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PLANT HAVING ALTERED ENVIRONMENTAL STRESS TOLERANCE 

FIELD OF THE INVENTION 

The present invention relates to the regulatory response of plants to 
5 environmental stresses such as cold and to drought. More specifically, the 
present invention relates to genes which regulate the response of a plant to 
environmental stresses such as cold or drought and their use to enhance the 
stress tolerance of recombinant plants into which these genes are introduced, 

10 BACKGROUND OF THE INVENTION 

Environmental factors serve as cues to trigger a number of specific 
changes in plant growth and development. One such factor is low temperature. 
Prominent examples of cold-regulated processes include cold acclimation, the 
15 increase in freezing tolerance that occurs in response to low non-freezing 

temperatures (Guy. C. L., Annu. Rev. Plant Physiol. Plant Mol. Biol. 41:187-223 
(1990)); vernalization, the shortening of time to flowering induced by low 
temperature (Lang, A., in Encyclopedia of Plant Physiology, Vol. 15-1, ed. 
Ruhland, W. (Springer, Berlin), pp. 1489-1536 (1965)); and stratification, the 
20 breaking of seed dormancy by low temperature (Berry, J. A. and J. K. Raison, 
in Encyclopedia of Plant Physiology, Vol. 12A, eds. Lange, O. L., Nobel. P. C. 
Osmond, C. B. and Ziegler, H. (Springer. Beriin), pp. 277-338 (1981)). Due to 
the fundamental nature and agronomic importance of these processes, there is 
interest in understanding how plants sense and respond to low temperature, 
25 One approach being taken is to determine the signal transduction pathways 
and regulatory mechanisms involved in cold-regulated gene expression. 

Strong evidence exists for calcium having a role in low temperature 
signal transduction and regulation of at least some COR (cold-regulated) 
genes. Dhindsa and colleagues (Monroy, A. F., et al, Plant Physiol. 102:1227- 
30 1235 (1993); Monroy, A. F., and R. S., The Plant Cell, 7:321-331 (1995)) have 
shown that, in alfalfa, calcium chelators and calcium channel blockers prevent 
low temperature induction of COR genes and that calcium ionophores and 
calcium channel agonists induce expression of COR genes at normal growth 

1 
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temperatures. Similarly, Knight et al (The Plant Cell 8:489-503 (1996)) have 
shown that cold-induced expression of the Arabidopsis thaliana COR gene 
KIN1 is inhibited by calcium chelators and calcium channel blockers. These 
results suggest that low temperature triggers an influx of extracellular calcium 

5 that activates a signal transduction pathway that induces the expression of 
COR genes. Consistent with this notion is the finding that low temperature 
evokes transient increases in cytosolic calcium levels in plants (Knight, M. R. et 
at, Nature 352:524-526 (1991); Knight, H., et al., The Plant Cell 8:489-503 
(1996)). In addition, low temperatures have been shown to stimulate the 

10 activity of mechanosensitive calcium-selective cation channels in plants (Ding, 
J. P. and B. G. Pickard, Plant J. 3:713-720 (1993)). 

Recent efforts have led to the identification of a c/s-acting cold- 
regulatory element in plants, the C-repeat/DRE (Yamaguchi-Shinozaki, et al.. 
The Plant Cell 6:251-264 (1994); Baker. S. S., et al., Plant. Mol. Biol. 24:701- 

15 713 (1994); Jiang, C. et al., Plant Mol. Biol. 30:679-684 (1996)). The element, 
which has a 5 base pair core sequence for CCGAC, is present once to multiple 
times in all plant cold-regutated promoters that have been described to date; 
these include the promoters of the C0R15a (Baker, S. S., et al, Plant. Mol. 
Biol. 24:701-713 (1994)), COR78/RD29A (Horvath, D. P., et al. Plant Physiol. 

20 103:1047-1053 (1993); Yamaguchi-Shinozaki, K., et al., The Plant Cell 6:251- 
264 (1994)). COR6,6 (Wang, H., et al.. Plant Mol. Biol. 28:605-617 (1995)) and 
KIN1 (Wang. H.. et al. Plant Mol. Biol. 28:605-617 (1995)) genes of Arabidopsis 
and the BN115 gene of Brassica napus (White, T. C, et al, Plant Physiol. 
106:917-928 (1994)). Deletion analysis of the Arabidopsis COR15a gene 

25 suggested that the CCGAC sequence, designated the C-repeat, might be part 
of a c/s-acting cold-regulatory element (Baker, S. S., et al., Plant Mol. Biol. 
24:701-713 (1994)). That this was the case was first demonstrated by 
Yamaguchi-Shinozaki and Shinozaki (Yamaguchi-Shinozaki, K., et al., The 
Plant Cell 6:251-264 (1994)) who showed that two of the C-repeat sequences 

30 present in the promoter of COR78/RD29A induced cold-regulated gene 

expression when fused to a reporter gene. It was also found that these two 
elements stimulate transcription in response to dehydration and high salinity 
and thus, was designated the DRE (dehydration, low temperature and high salt 
regulatory element). Recent studies by Jiang et al (Jiang, C, et al., Plant Mol. 
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Biol. 30:679-684 (1996)) indicate that the C-repeats (referred to as low 
temperature response elements) present in the promoter of the B. napus 
BN115 gene also impart cold-regulated gene expression. 

U.S. Patents Nos. 5,296,462 and 5,356,816 to Thomashow describe 
5 the genes encoding the proteins involved in cold adaptation in Arabidopsis 
thaliana. In particular the DNA encoding the COR1 5 proteins is described. 
These proteins are significant in promoting cold tolerance in plants. 

A need exists for the identification of genes which regulate the 
expression of cold tolerance genes and drought tolerance genes. A further 
10 need exists for DNA constructs useful for introducing these regulatory genes 
into a plant in order to cause the plant to begin expressing or enhance their 
expression of native or non-native cold tolerance genes and drought tolerance 
genes. These and other needs are provided by the present invention. 



3 
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SUMMARY OF THE INVENTION 

DNA in isolated form is provided which includes a sequence encoding a 
binding protein capable of selectively binding to*a DNA regulatory sequence 
5 which regulates expression of one or more environmental stress tolerance 
genes in a plant. The binding protein is preferably capable of regulating 
expression of one or more environmental stress tolerance genes in a plant by 
selectively binding to a DNA regulatory sequence which regulates the one or 
more environmental stress tolerance genes. In one embodiment, the binding 

10 protein is a non-naturally occumng protein formed by combining an amino acid 
sequence capable of binding to a CCG regulatory sequence, preferably a 
CCGAC regulatory sequence with an amino acid sequence which forms a 
transcription activation region which regulates expression of one or more 
environmental stress tolerance genes In a plant by regulating expression of one 

15 or more environmental stress tolerance genes when the binding protein binds 
to the regulatory region. 

DNA in isolated form is also provided which includes a promoter and 
the sequence encoding the binding protein. In one variation, the promoter 
causes expression of the binding protein in a manner which is different than 

20 how the binding protein is expressed in its native state. For example, the 
promoter may increase the level at which the binding protein is expressed, 
express the binding protein without being induced by an environmental stress 
and/or express the binding protein in response to a different form or degree of 
environmental stress than would otherwise be needed to induce expression of 

25 the binding protein. The promoter may also be inducible by an exogenous 

agent. The promoter can also be selected with regard to the type or types of 
plant tissues that the binding protein will be expressed as well as when in the 
plant's life the promoter will function to regulate expression of the binding 
protein. 

30 A nucleic acid construct capable of transforming a plant is also provided 

which includes a sequence encoding a binding protein capable of selectively 
binding to a DNA regulatory sequence which regulates expression of one or 
more environmental stress tolerance genes in a plant. The binding protein is 
preferably capable of regulating expression of one or more environmental 

4 
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Stress tolerance genes in a plant by selectively binding to a DNA regulatory 
sequence which regulates the one or more environmental stress tolerance 
genes. The nucleic acid construct may be an RNA or DNA construct. 
Examples of types of constructs include, but are not limited ttf DNA and RNA 
5 viral vectors and plasmids. 

A nucleic acid construct capable of transforming a plant is also provided 
which includes a sequence which when transformed into a plant expresses a 
binding protein capable of selectively binding to a DNA regulatory sequence 
which regulates one or more environmental stress tolerance genes in the plant. 

10 The binding protein preferably regulates expression of one or more 

environmental stress tolerance genes in the plant by selectively binding to a 
DNA regulatory sequence which regulates the one or more environmental 
stress tolerance genes. 

In one variation of the above constructs, the construct also includes a 

15 promoter which regulates expression of the binding protein encoding 

sequence. The promoter may optionally be homologous or heterologous 
relative to the binding protein encoding sequence. The promoter and binding 
protein encoding sequence may also optionally be native to the same or a 
different plant species. In one variation, the promoter causes expression of the 

20 binding protein in a manner which is different than how the binding protein is 
expressed in its native state. For example, the promoter may increase the 
level at which the binding protein is expressed, express the binding protein 
without being induced by an environmental stress and/or express the binding 
protein in response to a different form or degree of environmental stress than 

25 would otherwise be needed to induce expression of the binding protein. The 
promoter may also be inducible by an exogenous agent. The promoter can 
also be selected with regard to the type or types of plant tissues that the 
binding protein will be expressed as well as when in the plant's life the 
promoter will function to regulate expression of the binding protein. For 

30 example, flower-, fruit- and seed-specific promoters can be used to regulate the 
expression of the binding protein in these tissues of the plant, especially when 
sudden frosts strike in eariy spring and late fall. 

A binding protein in isolated form is also provided which is capable of 
selectively binding to a DNA regulatory sequence which regulates expression 

5 
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of one or more environmental stress tolerance genes in a plant. The binding 
protein is preferably capable of regulating expression of on or more 
environmental stress tolerance genes in the plant by selectively binding to a 
DNA regulatory sequence which regulates the one or more environmental 
stress tolerance genes. 

A recombinant binding protein expressed within a plant is also provided 
which is capable of selectively binding to a DNA regulatory sequence in the 
plant which regulates expression of one or more environmental stress 
tolerance genes in the plant. The recombinant binding protein is preferably 
capable of regulating expression of one or more environmental stress tolerance 
genes in the plant by selectively binding to a DNA regulatory sequence which 
regulates the one or more environmental stress tolerance genes. The 
recombinant binding protein may be native or non-native to the plant. Further, 
the recombinant binding protein may be homologous or heterologous relative to 
the DNA binding protein present in the plant in which the binding protein is 
expressed. 

A transformed cell of an organism is also provided which includes a 
recombinant sequence encoding a binding protein capable of selectively 
binding to a DNA regulatory sequence which regulates expression of one or 
more environmental stress tolerance genes in a plant. The binding protein is 
preferably capable of regulating expression of one or more environmental 
stress tolerance genes in a plant by selectively binding to a DNA regulatory 
sequence which regulates the one or more environmental stress tolerance 
genes. The transformed cell may be a unicellular organism such as a 
bacterium, yeast or virus, or from a multicellular organism such as a fungus or 
a plant. 

A transformed cell is also provided which includes a promoter and a 
sequence encoding a binding protein where at least one of the promoter and 
sequence under regulatory control of the promoter is recombinant. Optionally, 
one or both of the promoter and sequence under regulatory control of the 
promoter is not native to the cell. In one variation, the promoter causes 
expression of the binding protein in a manner which is different than how the 
binding protein is expressed in its native state. For example, the promoter may 
increase the level at which the binding protein is expressed, express the 
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binding protein without being induced by an environmental stress and/or 
express the binding protein in response to a different form or degree of 
environmental stress than would othenwise be needed to induce expression of 
the binding protein. The promoter may also be inducible by an exogenous 
5 agent. The promoter can also be selected with regard to the type or types of 
plant tissues that the binding protein will be expressed as well as when in the 
plant's life the promoter will function to regulate expression of the binding 
protein. 

A transformed cell is also provided which includes a recombinant 

10 binding protein expressed within the cell which is capable of selectively binding 
to a DNA regulatory sequence in the plant which regulates expression of one or 
more environmental stress tolerance genes in the plant. The binding protein is 
preferably capable of regulating expression of one or more environmental 
stress tolerance genes in the plant by selectively binding to a DNA regulatory 

15 sequence which regulates the one or more environmental stress tolerance 
genes. The binding protein may be native or non-native to the cell. 

A transformed plant with modified environmental stress tolerance gene 
expression is also provided. In one embodiment, the transformed plant 
includes one or more environmental stress tolerance genes; a DNA regulatory 

20 sequence which regulates expression of the one or more environmental stress 
tolerance genes; and a recombinant sequence encoding a binding protein 
capable of selectively binding to the DNA regulatory sequence. 

In another embodiment, the transformed plant includes one or more 
environmental stress tolerance genes; a DNA regulatory sequence which 

25 regulates expression of the one or more environmental stress tolerance genes; 
a sequence encoding a binding protein capable of selectively binding to the 
DNA regulatory sequence; and a recombinant promoter which regulates 
expression of the sequence encoding the binding protein. 

In yet another embodiment, the transformed plant includes one or more 

30 environmental stress tolerance genes; a recombinant DNA regulatory 

sequence which regulates expression of the one or more environmental stress 
tolerance genes; and a sequence encoding a binding protein capable of 
selectively binding to the DNA regulatory sequence. 
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In yet another embodiment, the transformed piant includes at least one 
recombinant environmental stress tolerance gene; a DNA regulatory sequence 
which regulates expression of the at least one environmental stress tolerance 
gene; and a sequence encoding a binding protein capable of selectively 
binding to the DNA regulatory sequence. 

In yet another embodiment, the transformed piant includes at least one 
recombinant environmental stress tolerance gene; a DNA regulatory sequence 
which regulates expression of the at least one environmental stress tolerance 
gene; and a recombinant binding protein expressed by the plant which is 
capable of selectively binding to the DNA regulatory sequence. 

A method for altering an environmental stress tolerance of a plant is 
also provided. In one embodiment, the method includes transforming a plant 
with at least one copy of a gene encoding a binding protein capable of binding 
to a DNA regulatory sequence which regulates one or more environmental 
stress tolerance genes in the plant; expressing the binding protein encoded by 
the gene; and stimulating expression of at least one environmental stress 
tolerance gene through binding of the binding protein to the DNA regulatory 
sequence. 

In another embodiment, the method includes transforming a plant with a 
promoter which regulates expression of at least one copy of a gene encoding a 
binding protein capable of binding to a DNA regulatory sequence which 
regulates one or more environmental stress tolerance genes in the plant or in 
specific type or types of plant tissues; expressing the binding protein encoded 
by the gene; and stimulating expression of at least one environmental stress 
tolerance gene through binding of the binding protein to the DNA regulatory 
sequence. 

In another embodiment, the method includes transforming a plant with 
one or more environmental stress tolerance genes whose expression is 
regulated by a DNA regulatory sequence; and expressing a binding protein 
capable of binding to the DNA regulatory sequence and activating expression 
of the one or more environmental stress tolerance genes. 

According to any one of the above embodiments of the present 
invention, the binding protein may optionally be selected such that it selectively 
binds to a member of a class of DNA regulatory sequences which includes the 

8 
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subsequence CCG or more particularly one of the following subsequences: 
CCGAA, CCGAT, CCGAC, CCGAG, CCGTA, CCGTT. CCGTC, CCGTG. 
CCGCA, CCGCT, CCGCG, CCGCC, CCGGA, CCGGT. CCGGC, CCGGG, 
AACCG. ATCCG, ACCCG, AGCCG, TACCG. TTCCG, TCCCG, TGCCG. 
CACCG CCCG, GACCG, GTCCG. GCCCG, GGCCG, ACCGA, ACCGT. 
ACCGC, ACCGG, TCCGA, TCCGT, TCCGC, TCCGG, CCCGA. CCCGT, 
CCCGC, CCCGG, GCCGA. GCCGT, GCCGC, and GCCGG. The binding 
protein may also be selected such that the binding protein includes an AP2 
domain. 

In each of the above embodiments, the level of expression of the 
binding protein may be the same or different than the level of expression of the 
binding protein in its native state. Expression of the binding protein in the 
transformed cell may be regulated by a recombinant promoter which may have 
the effect of increasing the level at which the binding protein is expressed, 
expressing the binding protein without being induced by an environmental 
stress and/or expressing the binding protein in response to a different form or 
degree of environmental stress than is othenA/ise needed to induce expression 
of the binding protein. Expression may also be induced by an exogenous 
agent. Expression may also be limited to selected types of plant tissues or 
selected periods in the plant's life based on which promoter is used. By 
selecting in what tissues and when in a plant's life the binding protein is 
expressed, in combination with the selecting how the binding protein is 
expressed (level of expression and/or type of environmental or chemical 
induction), an incredible range of control over the environmental stress 
responses of a plant can be achieved by the present invention. 

In each of the above embodiments, the binding protein comprises an 
amino acid sequence which is capable of binding to a DNA regulatory 
sequence which regulates one or more environmental stress tolerance genes. 
In a preferred embodiment, the binding protein further comprises a transcription 
activation region which acts in concert with the binding sequence to regulate 
expression of one or more environmental stress tolerance genes in the plant by 
regulating expression of one or more environmental stress tolerance genes. 
The environmental stress tolerance gene, DNA regulatory sequence, sequence 
encoding the binding sequence, and the sequence encoding the transcription 
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activation region may each independently be native or non-native to the plant 
and may each independently be homologous or heterologous relative to each 
other. 

Optionally, the binding protein satisfies one or more of the following 
5 requirements: 

the binding protein comprises an AP2 domain which comprises a 
consensus sequence sufficiently homologous to any one of the consensus 
sequences shown in Figures 19A, 19B, or 19C that the binding protein is 
capable of binding to a CCG regulatory sequence, preferably a CCGAC 
10 regulatory sequence; 

the binding protein comprises an AP2 domain which comprises a 
consensus sequence shown in Figures 19A, 19B or 19C; 

the binding protein comprises an AP2 domain which comprises the 
amino acid residues shown in Figures 19D or 19E; 
15 the binding protein comprises an AP2 domain which is sufficiently 

homologous to at least one of the AP2 domains shown in the application such 
that it is capable of binding to a CCG regulatory sequence, preferably a 
CCGAC regulatory sequence; 

the binding protein comprises one of the AP2 domain sequences shown 
20 in this application, including, but not limited to SEQ. I.D. Nos. 2, 13, 15, 39, 41, 
43. 45, 47, 49, 51, 53. 55, 57, 59, 61. 63. 65. 67, 69. 71. 73, 75. 77, 79, 81, 83, 
85, 87, 89. 91, 93, and 95; 

the binding protein comprises a sequence which comprises one of the 
amino terminus domains shown In Figure 20 (it is noted that the sequence 
25 need not be at the amino terminus of the binding protein); 

the binding protein comprises the consensus sequence for the amino 
terminus domains shown in Figure 20, (it is noted that the sequence need not 
be at the amino terminus of the binding protein); 

the binding protein comprises a sequence which comprises one of the 
30 carboxy terminus domains shown in Figure 21 A (it is noted that the sequence 
need not be at the carboxy terminus of the binding protein); 

the binding protein comprises the consensus sequence for the carboxy 
terminus domains shown in Figure 21A (it is noted that the sequence need not 
be at the carboxy terminus of the binding protein); and 

10 
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the binding protein comprises the consensus sequence for the carboxy 
terminus domains shown in Figure 21 B (it Is noted that the sequence need not 
be at the carboxy terminus of the binding protein). 

The amino acid sequence encoding the binding protein may be a 
5 naturally occurring sequence such as the ones shown in SEQ. ID. Nos. 2. 13. 
15. 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65. 67. 69, 71, 73, 75, 77, 
79, 81. 83, 85, 87, 89, 91, 93, and 95 or may be a non-naturally occurring 
sequence. It is noted, however, that binding proteins according to the present 
invention are intended to encompass non-naturally occurring sequences which 

10 are derivatives of the classes of binding proteins taught herein. For example, 
additional binding proteins may be constructed using one of the AP2 domains 
taught herein or the consensus sequence of these AP2 domains. It may be 
desirable to include with the AP2 domain a transcription activation region. The 
transcription activation region may be native to the plant or non-native to the 

15 plant in which the binding protein will be used. For example, the sequence may 
include a subsequence which encodes a binding domain for the DNA 
regulatory sequence fused to a transcription activating region, such as the 
transcription activating region of VP16 or GAL4. Optionally, one can include in 
the binding protein one of the amino terminus domains, the consensus 

20 sequence for the amino terminus domain, one of the carboxy terminus domains 
and/or the consensus sequence for the carboxy terminus domains. It is noted 
that the amino terminus domain may be positioned away from the amino 
terminus of the new binding protein and the carboxy terminus domain may be 
positioned away from the carboxy temninus of the new binding protein. 

25 Optionally, the binding protein can be viewed as comprising one of the 

amino terminus domains, the consensus sequence for the amino terminus 
domain, one of the carboxy terminus domains and/or the consensus sequence 
for the carboxy terminus domains. It is noted that the amino terminus domain 
may be positioned away from the amino terminus of the new binding protein 

30 and the carboxy terminus domain may be positioned away from the carboxy 
terminus of the new binding protein. 

A method is also provided for identifying from a cDNA library of at least 
a portion of a plant genome a gene sequence encoding a protein capable of 

11 



BNSDOCID: <WO ,9938977A2_L> 



wo 99/38977 



PCTAJS99/01895 



binding to a target DNA regulatory sequence. In one embodiment, the method 
comprises 

taking a microorganism which includes a target DNA regulatory 
sequence for one or more environmental stress tolerance genes, a 
5 transcription activator for activating expression of a reporter gene, and a 
reporter gene whose expression is activated by a protein which includes a 
binding domain capable of binding to the target DNA regulatory sequence and 
an activation domain capable of activating the transcription activator; 

fusing sequences from a cDNA library of at least a portion of a plant 
10 genome to a sequence which encodes a functional activation domain in the 
microorganism; 

introducing the fused sequences into the microorganism; and 
selecting microorganisms which express the reporter gene, expression 
of the reporter gene indicating expression of a fusion protein which includes a 
1 5 binding domain for the target DNA regulatory sequence and the activation 
domain; and 

identifying the gene sequence from the cDNA library introduced Into the 
microorganism. 

The target DNA regulatory sequence may optionally include the subsequence 
20 CCG or the subsequence CCGAC. This embodiment of the invention also 

relates to DNA in substantially isolated form, nucleic acid constructs capable of 
transforming a plant, cells, and transformed plants which include a gene 
sequence identified by this method. 

While the present invention is described with regard to the use of 
25 binding proteins which can bind to a DNA regulatory sequence that regulates 
environmental stress tolerance genes in a plant, it is noted that these same 
binding proteins can also be used to regulate genes other than environmental 
stress tolerance genes by placing these other genes under the regulatory 
control of the DNA regulatory sequence. For example, protein kinases that 
30 induce cold and drought inducible genes can be regulated by placing a protein 
kinase gene under the control of a promoter whose expression is regulated by 
the DNA regulatory sequence. PCT/US97/23019 (IntI Publication Number WO 
98/26045) describes protein kinases that when constitutively expressed, induce 
cold and drought inducible genes. The ATCDPKIa and the ATCDPK1 
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constitutive protein i<inase coding regions (PCT/US97/23019) can be isolat d 
by PGR and inserted into the drought and cold inducible promoters described 
in Example 8 by one skilled in the art. The expression of these ATCDPK1 
constitutive protein l<inase coding regions (PCT/US97/23019) from the drought 
and cold inducible promoters will increase the drought and cold tolerance of 
plants and should be synergistic with the drought and cold tolerance induced 
by CBF expression under inducible promoters. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figures 1A and IB show how the yeast reporter strains were 
constructed. 

Figure 1A is a schematic diagram showing the screening strategy. 
Figure 1 B is a chart showing activity of the "positive" cDNA clones in 
yeast reporter strains. 

Figures 2A, 2B, 2C and 20 provide an analysis of the pACT-1 1 cDNA 

clone. 

Figure 2A is a schematic drawing of the pACT-1 1 cDNA insert 
indicating the location and 5' to 3' orientation of the 24 kDa polypeptide and 25s 
rRNA sequences. 

Figure 2B is a DNA and amino acid sequence of the 24 kDa polypeptide 
(SEQ ID NO:1 and SEQ ID NO:2). 

Figure 2C is a schematic drawing indicating the relative positions of the 
potential nuclear localization signal (NLS). the AP2 domain and the acidic 
region of the 24 kDa polypeptide. 

Figure 2D is a chart showing comparison of the AP2 domain of the 24 
kDa polypeptide with that of the tobacco DNA binding protein EREBP2. 

Figure 3 is a chart showing activation of reporter genes by the 24 kDa 
polypeptide. 

Figure 4 is a photograph of an electrophoresis gel showing expression 
of the recombinant 24 kDa polypeptide in E. coli. 

Figure 5 is a photograph of a gel for shift assays indicating that CBF1 
binds to the C-repeat/DRE. 
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Figure 6 is a photograph of a southern blot analysis indicating CBF1 is 
a unique or low copy number gene. 

Figures 7A. 7B and 7C relate to CBF1 transcripts in control and cold- 
treated Arabidopsis. 

Figure 7A is a photograph of a membrane RNA isolated from 
Arabidopsis plants that were grown at 22 C or grown at 22 C and transferred to 

2.5 C for the indicated times. 

Figure 7B is a graph showing relative transcript levels of CBF1 in 
control and cold-treated plants. 

Figure 7C is a graph showing relative transcript levels of COR15a in 
control and cold-treated plants. 

Figure 8 is a Northern blot showing CBF1 and CO« transcript levels in 
RLD and transgenic Arabidopsis plants. 

Figure 9 is an immunoblot showing CORISam protein levels in RLD 
and transgenic Arabidopsis plants. 

Figures 1 0A and 1 0B are graphs showing freezing tolerance of leaves 
from RLD and transgenic Arabidopsis plants. 

Figure 1 1 is a photograph showing freezing survival of RLD and A6 
Arabidopsis plants. 

Figure 12 shows the DNA sequence for CBF2 encoding CBF2. 
Figure 13 shows the DNA sequence for CBF3 encoding CBF3. 
Figure 14 shows the amino acid alignment of proteins CBF1, CBF2 and 

CBF3. 

Figure 15 is a graph showing transcription regulation of COR genes by 
CBF1, CBF2 and CBF3 genes in yeast. 

Figure 16 shows the amino acid sequence of a canola homolog and its 
alignment to the amino acid sequence of CBF1. 

Figures 17A. 17B. 17C. 17D. 17E. 17Fand 1 7G show restriction maps 
of plasmids PMB12008, pMB12009, pMB12010, pMB12011. pM812012. 
PMB12013, and pMB12014, respectively. 

Figure 18A shows the DNA sequences for the CBF homologs from 
Brass.cajuncea, Brassica napus. Brassica oleracea. Brassica rapa, Glycine 
max, Raphanus sativus and Zea Maize. 
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Figure 18B shows the amino acid sequences (one-letter abbreviations) 
encoded by the DNA sequences (shown in FIG. 18A) for CBF homologs from 
Brassicajuncea. Brassica napus. Brassica oleracea. Brassies rapa, Glycine 
max, Raptianus sativus and Zea Maize. 

Figure 19A shows an amino acid alignment of the AP2 domains of 
several CBF proteins with the consensus sequence between the proteins 
highlighted as well as a comparison of the AP2 domains with that of the 
tobacco DNA binding protein EREBp2. 

Figure 19B shows an amino acid alignment of the AP2 domains of 
several CBF proteins including dreb2a and dreb2b with the consensus 
sequence between the proteins highlighted. 

Figure 19C shows an amino acid alignment of the AP2 domains of 
several CBF proteins including dreb2a, dreb2b. and tiny with the consensus 
sequence between the proteins highlighted. 

Figure 19D shows a difference between the consensus sequence 
shown in Figure 19A and tiny. 

Figure 19E shows a difference between the consensus sequence 
shown in Figure 19B and tiny. 

Figure 20 shows an amino acid alignment of the amino terminus of 
several CBF proteins with their consensus sequence highlighted. 

Figure 21 A and 21 B show an amino acid alignment of the carboxy 
terminus of several CBF proteins, with their consensus sequences highlighted. 

DETAILED DESCRIPTION 



The present invention relates to DNA encoding binding proteins capable 
of binding to a DNA regulatory sequence which regulates expression of one or 
more environmental stress tolerance genes in a plant. The present invention 
also relates to the binding proteins encoded by the DNA. The DNA and binding 
proteins may be native or non-native relative to the DNA regulatory sequence 
of the plant. The DNA and binding proteins may also be native or non-native 
relative to environmental stress tolerance genes of the plant which are 
regulated by the DNA regulatory sequence. 
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The present invention also relates to methods for using the DNA and 
binding proteins to regulate expression of one or more native or non-native 
environmental stress tolerance genes in a plant. These methods may include 
introducing DNA encoding a binding protein capable of binding to a DNA 
5 regulatory sequence into a plant, introducing a promoter into a plant which 
regulates expression of the binding protein, introducing a DNA regulatory 
sequence into a plant to which a binding protein can bind, and/or introducing 
one or more environmental stress tolerance genes into a plant whose 
expression is regulated by a DNA regulatory sequence. 
^° The present invention also relates to recombinant cells, plants and plant 

materials (e.g., plant tissue, seeds) into which one or more gene sequences 
encoding a binding protein have been introduced as well as cells, plants and 
plant materials within which recombinant binding proteins encoded by these 
gene sequences are expressed. By introducing a gene sequence encoding a 
15 binding protein into a plant, a binding protein can be expressed within the plant 
which regulates expression of one or more stress tolerance genes in the plant. 
Regulation of expression can include causing one or more stress tolerance 
genes to be expressed under different conditions than those genes would be in 
the plant's native state, increasing a level of expression of one or more stress 
20 tolerance genes, and/or causing the expression of one or more stress tolerance 
genes to be inducible by an exogenous agent. Expression of the binding 
protein can be under the control of a variety of promoters. For example, 
promoters can be used to overexpress the binding protein, change the 
environment conditions under which the binding protein is expressed, or enable 
25 the expression of the binding protein to be induced, for example by the addition 
of an exogenous inducing agent. Promoters can also be used to cause the 
protein to be expressed at selected times during a plant's life. Tissue-specific 
promoters can be used to cause the protein to be expressed in selected 
tissues. For example, flower-, fruit- and seed-specific promoters can be used 
to cause the protein to be selectively expressed in flowers, fruits or seeds of 
the plant. 

The present invention also relates to cells, recombinant plants and plant 
materials into which a recombinant promoter is introduced which controls a 
level of expression of one or more gene sequences encoding a binding protein. 

16 
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The one or more gene sequences may be recombinant native or non-native 
sequences or may be native, non-recombinant gene sequences whose 
expression is altered by the introduction of the recombinant promoter. 

The present invention also relates to cells, recombinant plants and plant 
materials into which a recombinant native or non-native DNA regulatory 
sequence is introduced which regulates expression of one or more native or 
non-native environmental stress tolerance genes. 

Examples of environmental stresses for which stress tolerance genes 
are known to exist include, but are not limited to, cold tolerance, dehydration 
tolerance, and salinity tolerance. As used herein, environmental stress 
tolerance genes refer to genes which function to acclimate a plant to an 
environment stress. For example, cold tolerance genes, also referred to as 
COR genes (COId Regulated), refer to genes which function to acclimate a 
plant to a cold temperature environment. These genes typically are activated 
when a plant is exposed to cold temperatures. Dehydration tolerance genes 
refer to genes which function to acclimate a plant to dehydration stress. These 
genes typically are activated in response to dehydration conditions which can 
be associated with drought or cold temperatures which cause water in the plant 
to freeze and thereby dehydrate the plant tissue. It is noted that some cold 
tolerance genes may function to provide a plant with a degree of dehydration 
tolerance and visa versa. For example, COR genes are known to also be 
activated by dehydration stress. This application is intended to encompass 
genes which regulate one or more environmental stress tolerance genes such 
as cold tolerance genes, dehydration tolerance genes, and genes which 
perform a dual function of cold and dehydration tolerance. 

One embodiment of the invention relates to a DNA sequence in isolated 
form which includes a sequence encoding a binding protein capable of 
selectively binding to a DNA regulatory sequence which regulates expression 
of one or more environmental stress tolerance genes in a plant. The binding 
protein is preferably capable of regulating expression of one or more 
environmental stress tolerance genes in a plant by selectively binding to a DNA 
regulatory sequence which regulates the one or more environmental stress 
tolerance genes. In one variation, the binding protein is a non-naturally 
occurring protein formed by combining an amino acid sequence capable of 
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binding to a CCG regulatory sequence, preferably a CCGAC regulatory 
sequence with an amino acid sequence which forms a transcription activation 
region which regulates expression of one or more environmental stress 
tolerance genes in a plant by regulating expression of one or more 
environmental stress tolerance genes when the binding protein binds to the 
regulatory region. 

The DNA sequence may exist in a variety of forms including a plasmid 
or vector and can include sequences unrelated to the gene sequence encoding 
the binding protein. For example, the DNA sequence can include a promoter 
which regulates expression of the regulatory gene. 

In one variation of this embodiment, the DNA regulatory sequence is a 
C-repeat cold and drought regulation element (C-repeat/DRE). As will be 
explained and demonstrated herein, C-repeat/DRE regulatory sequences 
appear to be conserved In plants with some degree of variability plant to plant. 
Using the teachings of the present invention. C-repeat/DRE regulatory 
sequences native to different plants can be identified as well as the native 
stress tolerance regulatory genes which encode for proteins which bind to the 
C-repeat/DRE DNA regulatory sequences. Hence, although the examples 
provided herein to describe the present invention are described with regard to 
the Arabadopsis C-repeat/DRE DNA regulatory sequence, the present 
invention is not intended to be limited to the Arabadopsis C-repeat/DRE DNA 
regulatory sequence. Rather, the Arabadopsis C-repeat/DRE DNA regulatory 
sequence is believed to be a member of a class of environmental stress 
response regulatory elements which includes the subsequence CCGAC which 
in turn is believed to be a member of a class of environmental stress response 
regulatory elements which includes the subsequence CCG. Other different 
classes of environmental stress response regulatory elements may also exist. 
The teachings of the present invention may be used to identify sequences 
which bind to these and other classes of environmental stress response 
regulatory elements once they are identified. 

In one variation of this embodiment, the gene sequence encodes a 
binding protein which selectively binds to a member of a class of DNA 
regulatory sequences which includes the subsequence CCG. In another 
variation, the gene sequence encodes a binding protein which selectively binds 
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to a member of a class of DNA regulatory sequences which includes the 
subsequence CCGAC. The CCGAC subsequence has been found to present 
in the C-repeat/DRE DNA regulatory sequences of Arabadopsis and Brassica 
and to function in Tobacco based on the ability of the C-repeat/DRE to direct 
cold and tolerance regulated gene expression. 

In yet another variation, the stress tolerance regulatory gene sequence 
encodes a binding protein which includes an AP2 domain. It is believed that a 
significant class of environmental stress tolerance regulatory genes encode for 
binding proteins with an AP2 domain capable of binding to the DNA regulatory 
sequence. The AP2 domain of the binding protein is preferably a homolog of 
the AP2 domain of one of the CBF binding proteins described herein. The 
subsequence encoding the AP2 domain is preferably a homolog of a 
subsequence of one of the CBF genes described herein which encodes an 
AP2 domain. 

In another variation, the DNA sequence encoding the binding protein 
satisfies one or more of the following requirements: 

the binding protein comprises an AP2 domain which comprises a 
consensus sequence sufficiently homologous to any one of the consensus 
sequences shown in Figures 19A, 19B. or 19C that the binding protein is 
capable of binding to a CCG regulatory sequence, preferably a CCGAC 
regulatory sequence; 

the binding protein comprises an AP2 domain which comprises a 
consensus sequence shown in Figures 19A, 19B or 19C; 

the binding protein comprises an AP2 domain which comprises the 
25 amino acid residues shown in Figures 19D or 19E; 

the binding protein comprises an AP2 domain which is sufficiently 
homologous to at least one of the AP2 domains shown in the application such 
that it is capable of binding to a CCG regulatory sequence, preferably a 
CCGAC regulatory sequence; 

the binding protein comprises one of the AP2 domain sequences shown 
in this application, including, but not limited to SEQ. I.D. Nos. 2, 13, 15, 39, 41, 
43. 45, 47. 49. 51. 53, 55, 57, 59. 61. 63. 65. 67. 69. 71, 73, 75'. 77, 79, 81, 83, 
85. 87, 89, 91, 93, and 95; 
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the binding protein comprises a sequence which comprises one of the 
amino terminus domains shown in Figure 20 (it is noted that the sequence 
need not be at the amino terminus of the binding protein); 

the binding protein comprises the consensus sequence for the amino 
terminus domains shown in Figure 20, (it is noted that the sequence need not 
be at the amino terminus of the binding protein); 

the binding protein comprises a sequence which comprises one of the 
carboxy terminus domains shown in Figure 21A (it is noted that the sequence 
need not be at the carboxy terminus of the binding protein); 

the binding protein comprises the consensus sequence for the carboxy 
tenninus domains shown in Figure 21A (it is noted that the sequence need not 
be at the carboxy tenninus of the binding protein); 

the binding protein comprises the consensus sequence for the carboxy 
terminus domains shown in Figure 21B (it is noted that the sequence need not 
be at the carboxy temninus of the binding protein); 

one of SEQ. i.D. Nos. 1, 12, 14, 40, 42. 44, 46, 48. 50, 52. 54, 56 58 
60, 62, 64, 66, 68. 70. 72. 74, 76, 78, 80. 82, 84. 86, 88, 90, 92. and 94; or ' 
a sequence which has substantially the same degree of homology to 
SEQ. I.D. Nos. 1, 12. 14, 40, 42, 44, 46, 48, 50, 52, 54. 56, 58, 60, 62, 64, 66, 
68, 70, 72. 74, 76. 78. 80. 82, 84, 86. 88, 90, 92. and 94 as these sequences 
have with each other. 

The present invention also relates to a method for identifying gene 
sequences from at least a portion of a plant genome which encode binding 
proteins capable of binding to a target DNA regulatory sequence which 
regulates expression of one or more stress tolerance genes in a plant. 

in one embodiment, the method includes: 

taking a microorganism which includes a target DNA regulatory 
sequence for one or more environmental stress tolerance genes, a 
transcription activator for activating expression of a reporter gene, and a 
reporter gene whose expression is activated by a protein which includes a 
binding domain capable of binding to the target DNA regulatory sequence and 
an activation domain capable of activating the transcription activator; 
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fusing sequences from a cDNA library of at least a portion of a plant 
genome to a sequence which encodes a functional activation domain in the 
microorganism; 

introducing the fused sequences into the microorganism: and 
selecting microorganisms which express the reporter gene, expression 
of the reporter gene indicating expression of a fusion protein which includes a 
binding domain for the target DNA regulatory sequence and the activation 
domain; and 

identifying the gene sequence from the cDNA library introduced into the 
microorganism. 

In one variation of the method, the target DNA regulatory sequence 
includes the subsequence CCG and in another embodiment includes the 
subsequence CCGAC. In yet another variation, the target DNA regulatory 
sequence is the C-repeat/DRE for Arabadopsis. According to the above 
method, the target DNA regulatory sequence is preferably native to the plant 
family and more preferably to the plant species from which the cDNA library is 
derived. 

In another variation of this embodiment, the cDNA library used in the 
method consists of sequences which encode for a protein having an AP2 
domain since it is believed that a significant class of genes encoding binding 
proteins for stress tolerance genes encode an AP2 domain. As will be 
explained herein, screening for DNA sequences from a plant genome which 
exhibit this functional feature has been shown to be effective for isolating gene 
sequences encoding binding proteins of the present invention. 

In another variation of this method, the sequences from the cDNA 
library are fused to a sequence which includes a selectable marker, the method 
further including the step of selecting for microorganisms expressing the 
selectable marker. 

While the above methodology of the present invention is described 
herein with regard to identifying binding protein gene sequences from 
Arabidopsis cDNA using the C-repeat/DRE regulatory sequence for 
Arabidopsis, it is noted that this methodology can be readily used to identify 
regulatory binding protein gene sequences for other plants by using a DNA 
regulatory sequence native to those plants. Alternatively, different 
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permutations of the CCG subsequence can be used as the target DNA 
regulatory sequence. 

An example of a microorganism which may be used in the above 
method is yeast. cDNA can be introduced into the microorganism by a variety 
of mechanisms including plasmids and vectors. In one particular embodiment, 
the reporter gene is beta-galactosidase. 

The present invention also relates to any DNA sequences and binding 
proteins encoded by those DNA sequences which are identified by the above 
screening method. 

The present invention also relates to a protein expressed by an 
environmental stress tolerance regulatory gene according to the present 
invention which can function in vivo in a plant to regulate expression of one or 
more environmental stress tolerance genes. 

According to one embodiment, the protein is a recombinant binding 
protein expressed by a copy of a recombinant gene which is either not native to 
the plant or is native to the plant but introduced into the plant by recombinant 
methodology. For example, one might wish to introduce one or more copies of 
a regulatory gene which is native to the plant but is under the control of a 
promoter which overexpresses the binding protein, expresses the binding 
protein independent of an environmental stress, expresses the binding protein 
at a higher level in response to the same environmental stress than would a 
plant in its native state, expresses the binding protein in response to different 
environmental stress conditions, and/or be induced to express the binding 
protein by an exogenous agent to which the plant can be exposed. 
Alternatively, one might wish to introduce one or more copies of a regulatory 
gene which is not native to the plant. For example, the non-native regulatory 
gene may be used to alter the way in which native environmental stress 
tolerance genes are regulated. Alternatively, the non-native regulatory gene 
may be used to regulate environmental stress tolerance genes which are also 
not native to the plant. The non-native regulatory gene may be used to bind to 
a DNA regulatory region which is not native to the plant. 

In another embodiment, the proteins have been isolated from a 
recombinant organism. The organism may be a microorganism (e.g.. bacteria. 
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yeast) or a multicellular organism such as a plant. In one variation, the protein 
is in substantially isolated form. 

In yet another embodiment, the protein is a native, non-recombinant 
binding protein whose expression is regulated within a plant by a recombinant 
native or non-native promoter. For example, one might wish to replace a native 
promoter with a recombinant promoter which overexpresses the binding 
protein, expresses the binding protein independent of an environmental stress, 
expresses the binding protein at a higher level in response to the same 
environmental stress than would a plant in its native state, expresses the 
binding protein in response to different environmental stress conditions, and/or 
be induced to express the binding protein by an exogenous agent to which the 
plant can be exposed. 

In one variation of the above embodiments, the protein is capable of 
selectively binding to a DNA regulatory sequence for one or more 
environmental stress tolerance genes in a plant. In another variation, the 
protein includes an AP2 domain which is capable of selectively binding to a 
DNA regulatory sequence for one or more environmental stress tolerance 
genes in a plant. One method which may be used to determine whether the 
protein binds selectively to the DNA regulatory sequence is a gel shift assay. 
The DNA regulatory sequence may optionally include a CCG subsequence, a 
CCGAC subsequence and optionally the C-repeat / ORE sequence of 
Arabadopsis. 

In another variation of the above embodiments, the binding protein 
satisfies one or more of the following requirements: 

the binding protein comprises an AP2 domain which comprises a 
consensus sequence sufficiently homologous to any one of the consensus 
sequences shown in Figures 19A, 19B. or 19C that the binding protein is 
capable of binding to a CCG regulatory sequence, preferably a CCGAC 
regulatory sequence; 

the binding protein comprises an AP2 domain which comprises a 
consensus sequence shown in Figures 19A, 19B or 19C; 

the binding protein comprises an AP2 domain which comprises the 
amino acid residues shown in Figures 19D or 19E; 
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the binding protein comprises an AP2 domain which is sufficiently 
homologous to at least one of the AP2 domains shown in the application such 
that it is capable of binding to a CCG regulatory sequence, preferably a 
CCGAC regulatory sequence; 

the binding protein comprises one of the AP2 domain sequences shown 
in this application, including, but not limited to SEQ. I.D. Nos. 2, 13, 15, 39, 41, 
43, 45, 47, 49, 51, 53. 55, 57, 59, 61. 63, 65, 67. 69, 71, 73, 75, 77, 79, 81', 83, 
85. 87, 89. 91. 93, and 95; 

the binding protein comprises a sequence which comprises one of the 
amino terminus domains shown in Figure 20 {it is noted that the sequence 
need not be at the amino terminus of the binding protein); 

the binding protein comprises the consensus sequence for the amino 
terminus domains shown in Figure 20. (it is noted that the sequence need not 
be at the amino terminus of the binding protein); 

the binding protein comprises a sequence which comprises one of the 
carboxy terminus domains shown in Figure 21A (it is noted that the sequence 
need not be at the carboxy terminus of the binding protein); 

the binding protein comprises the consensus sequence for the carboxy 
terminus domains shown in Figure 21 A (it is noted that the sequence need not 
be at the carboxy terminus of the binding protein); and 

the binding protein comprises the consensus sequence for the carboxy 
terminus domains shown in Figure 21B (it is noted that the sequence need not 
be at the carboxy terminus of the binding protein). 

The sequence of the binding protein may be a naturally occurring 
sequence such as the ones shown in SEQ. ID. Nos. 2. 13, 15, 39, 41, 43, 45, 
47, 49, 51, 53, 55, 57. 59. 61, 63, 65. 67, 69. 71. 73, 75, 77, 79, 81, 83, 85, 87, 
89, 91. 93. and 95 or may be a.non-naturally occurring sequence. It is noted, 
however, that binding proteins according to the present invention are intended 
to encompass non-naturally occumng sequences which are derivatives of the 
classes of binding proteins taught herein. For example, additional binding 
proteins may be constructed using one of the AP2 domains taught herein or the 
consensus sequence of these AP2 domains. It may be desirable to include 
with the AP2 domain a transcription activation region. The transcription 
activation region may be native to the plant or non-native to the plant in which 
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the binding protein will be used. For example, the sequence may include a 
subsequence which encodes a binding domain for the DNA regulatory 
sequence fused to a transcription activating region, such as the transcription 
activating region of VP16 or GAL4. Optionally, one can include in the binding 

5 protein one of the amino terminus domains, the consensus sequence for the 
amino terminus domain, one of the carboxy terminus domains and/or the 
consensus sequence for the carboxy temninus domains. It is noted that the 
amino terminus domain may be positioned away from the amino terminus of 
the new binding protein and the carboxy tenninus domain may be positioned 

0 away from the carboxy terminus of the new binding protein. 

Optionally, the binding protein can be viewed as comprising one of the 
amino terminus domains, the consensus sequence for the amino terminus 
domain, one of the carboxy terminus domains and/or the consensus sequence 
for the carboxy temiinus domains. It is noted that the amino tenninus domain 

5 may be positioned away from the amino terminus of the new binding protein 
and the carboxy terminus domain may be positioned away from the carboxy 
terminus of the new binding protein. 

In another embodiment, the binding protein is an isolated protein or a 
recombinantly produced protein which has a molecular weight of about 26 kDa 

0 as measured in an electrophoresis gel and binds to a DNA regulatory 

sequence which regulates a cold or dehydration regulated gene of Arabidopsis 
thaliana. 

The present invention also relates to DNA and RNA constructs, such as 
plasmids, vectors, and the like, which are capable of transfonning a plant. The 
constructs include a sequence which encodes a binding protein capable of 
selectively binding to a DNA regulatory sequence which regulates the one or 
more environmental stress tolerance genes. The binding protein is preferably 
able to regulate expression of one or more environmental stress tolerance 
genes in a plant by selectively binding to the DNA regulatory sequence. More 
preferably, when transformed into a plant, the sequence regulates expression 
of one or more environmental stress tolerance genes in the plant by expressing 
the binding protein. In one embodiment, the DNA construct includes a promoter 
and a regulatory gene sequence whose expression is under the control of the 
promoter. Different promoters may be used to select the degree of expression 
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or conditions under which the regulatory gene is expressed. For example, the 
promoter can be used to cause overexpression of the regulatory gene, 
expression of the regulatory gene independent of an environmental stress, 
expression of the regulatory gene at a higher level in response to the same 
environmental stress than would a plant in its native state, expression of the 
regulatory gene in response to different environmental stress conditions, and/or 
induction of expression of the regulatory gene by an exogenous agent to which 
the plant can be exposed. 

Promoters can also be used to cause the protein to be expressed at selected 
times during a plant's life. Tissue-specific promoters can be used to cause the 
protein to be expressed in selected tissues. For example, flower-, fruit- and 
seed-specific promoters can be used to cause the protein to be selectively 
expressed in flowers, fruits or seeds of the plant. 

In another embodiment, the DNA construct comprises a sequence 
which encodes: 

a binding protein comprising an AP2 domain which comprises a 
consensus sequence sufficiently homologous to any one of the consensus 
sequences shown in Figures 19A. 19B, or 19C that the binding protein is 
capable of binding to a CCG regulatory sequence, preferably a CCGAC 
regulatory sequence; 

a binding protein comprising an AP2 domain which comprises a 
consensus sequence shown in Figures 19A, 19B or 19C; 

a binding protein comprising an AP2 domain which comprises the 
amino acid residues shown in Figures 19D or 19E; 

a binding protein comprising an AP2 domain which is sufficiently 
homologous to at least one of the AP2 domains shown in the application such 
that it is capable of binding to a CCG regulatory sequence, preferably a 
CCGAC regulatory sequence; 

a binding protein comprising one of the AP2 domain sequences shown 
in this application, including, but not limited to SEQ. I.D. Nos. 2, 13, 15, 39, 4i, 
43. 45, 47. 49, 51. 53. 55, 57, 59, 61, 63, 65, 67, 69, 71, 73. 75, 77'. 79', 81'. 83', 
85. 87, 89, 91, 93, and 95; 
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a binding protein comprising a sequence which comprises one of the 
amino terminus domains shown in Figure 20 (it is noted that the sequence 
need not be at the amino terminus of the binding protein); 

a binding protein comprising the consensus sequence for the amino 
terminus domains shown in Figure 20, (it is noted that the sequence need not 
be at the amino terminus of the binding protein); 

a binding protein comprising a sequence which comprises one of the 
carboxy terminus domains shown in Figure 21A (it is noted that the sequence 
need not be at the carboxy terminus of the binding protein); 

a binding protein comprising the consensus sequence for the carboxy 
terminus domains shown in Figure 21 A (it is noted that the sequence need not 
be at the carboxy terminus of the binding protein); 

a binding protein comprising the consensus sequence for the carboxy 
terminus domains shown in Figure 21B (it is noted that the sequence need not 
15 be at the carboxy temiinus of the binding protein); 

one of SEQ. i.D. Nos. 1, 12, 14, 40, 42, 44, 46, 48. 50, 52, 54, 56, 58, 
60, 62, 64. 66, 68. 70, 72. 74. 76, 78, 80. 82, 84. 86, 88. 90. 92. and 94; or 

a sequence which has substantially the same degree of homology to 
SEQ. I.D. Nos. 1. 12. 14. 40. 42. 44, 46. 48, 50. 52, 54, 56, 58, 60, 62, 64, 66, 
68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88. 90, 92, and 94 as these sequences 
have with each other. 

The present invention also relates to plasmids pCBFI (ATCC 98063), 
pCBF2, and pCBF3. 

The present invention also relates to a recombinant microorganism, 
such as a bacterium, yeast, fungus, virus, into which at least one copy of a 
regulatory gene encoding a binding protein of the present invention has been 
introduced by a recombinant methodology. 

The present invention also relates to recombinant plants into which at 
least one copy of a regulatory gene encoding a binding protein of the present 
invention has been introduced by a recombinant methodology. The 
recombinant copy of the regulatory gene may be native or non-native to the 
plant and express a binding protein which is either native or non-native to the 
plant. Expression of the recombinant copy of the regulatory gene may be 
under the control of the promoter. The promoter may increase the level at 
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Which the regulatory gene is expressed, express the regulatory gene without 
being induced by an environmental stress and/or express the regulatory gene 
in response to a different form or degree of environmental stress than would 
othenwise be needed to induce expression of the regulatory gene. For 
example, a promoter can be used which turns on at a temperature that is 
warmer than the temperature at which the plant normally exhibits cold 
tolerance. This would enable the cold tolerance thermostat of a plant to be 
altered. Similarly, a promoter can be used which turns on at a dehydration 
condition that is wetter than the dehydration condition at which the plant 
normally exhibits dehydration tolerance. This would enable the level at which a 
plant responds to dehydration to be altered. A promoter can also be used 
which causes a higher level of expression to occur at a given environmental 
condition (e.g. temperature and/or dryness) than the plant would express in its 
native state. The promoter may also be inducible by an exogenous agent, i.e., 
express the regulatory gene in response to the presence of an agent to which 
the promoter is exposed. This would enable stress tolerance to be induced by 
applying an inducing agent to the plant. 

Selection of the promoter can also be used to determine what tissues in 
the plant express the binding protein as well as when expression occurs in the 
plant's lifecycle. By selecting a promoter which regulates in what tissues and 
when in a plant's life the promoter functions to regulate expression of the 
binding protein, in combination with the selecting how that promoter regulates 
expression (level of expression and/or type of environmental or chemical 
induction), an incredible range of control over the environmental stress 
responses of a plant can be achieved according to the present invention. For 
example, flower- fruit- and seed-specific promoters can be used to regulate the 
expression of the binding protein in these tissues of the plant, especially when 
sudden frosts strike in early sphng and late fall. 

The environmental stress tolerance gene regulated by the 
recombinantly expressed regulatory gene may be native or non-native to the 
plant. Hence, in one embodiment, the plant includes a recombinant copy of a 
regulatory gene which is native to the plant and expresses a native protein 
which functions within the plant to regulate expression of a native 
environmental stress tolerance gene. In this embodiment, the recombinant 
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plant expresses a higher level of the native regulatory gene than the plant 
would othenvise. 

In another embodiment, at least one of the regulatory genes and the 
environmental stress tolerance genes is not native to the plant. For example, 
the regulatory gene can be native and the environmental stress tolerance gene 
is non-native, or the regulatory gene is non-native and the environmental stress 
tolerance gene is native to the plant. 

In yet another embodiment, the plant can include a recombinant copy of 
a regulatory gene which is not native to the plant as well as a recombinant copy 
of one or more environmental stress tolerance genes which also is not native to 
the plant. According to this embodiment, the non-native regulatory gene 
expresses a non-native binding protein which functions within the plant to 
regulate expression of the one or more non-native environmental stress 
tolerance genes. In this regard, it is envisioned that the present invention can 
be used to introduce, change and/or augment the environmental stress 
tolerance of a plant by introducing and causing the expression of environmental 
stress tolerance which the plant does not have in its native form. Accordingly, 
plants from warmer climates can be engineered to include one or more cold 
tolerance genes along with a regulatory gene needed to cause expression of 
the cold tolerance genes in the plant so that the engineered plant can survive 
better in a colder climate. Similarly, a plant can be engineered to include one 
or more dehydration tolerance genes along with a regulatory gene needed to 
cause expression of the dehydration tolerance gene so that the engineered 
plant can grow better in a dryer climate. In this regard, it should be possible to 
take a plant which grows well in a first climate and engineer it to include stress 
tolerance genes and regulatory genes native to a second climate so that the 
plant can grow well in the second climate. 

The present invention also relates to a method for changing or 
enhancing the environmental stress tolerance of a plant. 

In one embodiment, the method includes introducing at least one copy 
of a regulatory gene encoding a binding protein of the present invention into a 
plant; expressing the binding protein encoded by the regulatory gene; and 
using the expressed binding protein to stimulate expression of at least one 
environmental stress tolerance gene through binding to a DNA regulatory 
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sequence. According to this embodiment, the regulatory gene may be non- 
recombinant or recombinant native or non-native to the plant. Similarly, the 
DNA regulatory sequence and the environmental stress tolerance gene may 
each independently be native or non-native to the plant. In one variation of this 
embodiment, the method further includes recombinantly introducing an 
environmental stress tolerance gene into the plant which is regulated by the 
recombinant regulatory gene. 

In another embodiment, the method includes introducing a recombinant 
promoter which regulates expression of a regulatory gene encoding a binding 
protein of the present invention into a plant; expressing the binding protein 
under the control of the recombinant promoter in the plant; and using the 
expressed binding protein to stimulate expression of at least one environmental 
stress tolerance gene through binding to a DNA regulatory sequence. 

According to this embodiment, the regulatory gene, the DNA regulatory 
sequence and the environmental stress tolerance gene may each 
independently be non-recombinant or recombinant native or non-native to the 
plant. Also according to this embodiment, the promoter can be a tissue- 
specific promoter such as a flower-, fruit- and seed-specific promoter. In this 
instance, expressing the binding protein includes selectively expressing the 
binding protein in a particular type of tissue, such as flowers, fruits or seeds of 
the plant. 

In yet another embodiment, the method includes introducing at least 
one recombinant environmental stress tolerance gene into a plant; expressing 
a binding protein; and using the expressed binding protein to stimulate 
expression of the recombinant environmental stress tolerance gene through 
binding to a DNA regulatory sequence. According to this embodiment, the 
gene encoding the regulatory protein, and the DNA regulatory sequence may 
each independently be non-recombinant or recombinant native or non-native to 
the plant. The recombinant environmental stress tolerance gene may be either 
native or non-native to the plant. 
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1. DEFINITIONS 



The term "C-repeat cold and drought regulation element" or "C- 
repeat/DRE" refers to a sequence which includes CCG and functions as a 
binding domain in a plant to regulate expression of one or more environmental 
stress tolerance genes, such as cold or dehydration stress tolerance genes. 

The term "cold stress" refers to a decrease in ambient temperature, 
including a decrease to freezing temperatures, which causes a plant to attempt 
to acclimate itself to the decreased ambient temperature. 

The term "dehydration stress" refers to drought, high salinity and other 
conditions which cause a decrease in cellular water potential in a plant. 

Transformation means the process for changing the genotype of a 
recipient organism by the stable introduction of DNA by whatever means. 

A transgenic plant is a plant containing DNA sequences which were 
introduced by transformation. Horticultural and crop plants particularly benefit 
from the present invention. 

Translation means the process whereby the genetic information In an 
mRNA molecule directs the order of specific amino acids during protein 
synthesis. 

The term "essentially homologous" means that the DNA or protein is 
sufficiently duplicative of that set forth in Figure 2B to produce the same result. 
Such DNA can be used as a probe to isolate DNA's in other plants. 

A promoter is a DNA fragment which causes transcription of genetic 
material. For the purposes described herein, promoter is used to denote DNA 
fragments that permit transcription in plant cells. 

A poly-A addition site is a nucleotide sequence which causes certain 
enzymes to cleave mRNA at a specific site and to add a sequence of adenylic 
acid residues to the 3'-end of the mRNA. 

The phrase "DNA in isolated form" refers to DNA sequence which has 
been at least partially separated from other DNA present in its native state in 
an organism. A cDNA library of genomic DNA is not "DNA in isolated form" 
whereas DNA which has been at least partially purified by gel electrophoresis 
corresponds to "DNA in isolated form". 
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2. CRepeat/DRE Regulatory Elem nts In Plants 

C-repeat cold and drought regulation elements (C-repeat/DRE) are 
sequences which function as a c/s-acting regulatory element that stimulates 
transcription in response to an environmental stress, such as low temperature 
(Yamaguchi-Shinozaki, K.. et al., The Plant Cell 6:251-264 (1994); and Baker, 
S. S., et al., Plant Mol. Biol. 24:701-713 (1994); Jiang, C, et al.. Rant Mol. Biol. 
30:679-684 (1996)) or dehydration stress and high salinity (Yamaguchi- 
Shinozaki. K.. et a!.. The Plant Cell 6:251-264 (1994)). An object of the 
research leading to the present invention was the determination of how a C- 
repeat/DRE stimulates gene expression in response to these environmental 
factors, and whether cold, dehydration and high salinity affect independent or 
overlapping regulatory systems. 

The first step toward determining how a C-repeat/DRE regulation 
element stimulates gene expression was the identification of the C-repeat cold 
and drought regulation element itself. The 5 base pair core sequence, 
CCGAC, has been found to be present once to multiple times in a variety of 
plant cold-regulated promoters in Arabidopsis and Brassica including the 
C0R15a (Baker, S. S., etal. Plant. Mol. Biol. 24:701-713 (1994)); 
COR78/RD29A (Horvath, D. P., et al. Plant Physiol. 103:1047-1053 (1993) and 
Yamaguchi-Shinozaki. K., et al.. The Plant Cell 6:251-264 (1994)); COR6.6 
(Wang, H., et al.. Plant Mol. biol. 28:605-617 (1995)); and KIN1 (Wang. H.. et 
al, Plant Mol. Biol. 28:605-617 (1995)) genes of Arabidopsis and the BN115 
gene of Brassica napus (White. T. C. et al. Plant Physiol. 106:917-928 (1994)). 
As shown in the examples herein, core sequence CCGAC was used to identify 
proteins encoded by genes within the Arabidopsis genome which bind to this 
core sequence. 

Applicants believe that the CCGAC core sequence is a member of 
family of core sequences having the common subsequence CCG. The binding 
of CBF1 to the C-repeat/DRE involves the AP2 domain. In this regard, it is 
germane to note that the tobacco ethylene response element, AGCCGCC, 
closely resembles the C-repeat/DRE sequences present in the promoters of 
the Arabidopsis genes COR15a, GGCCGAC, and COR78/RD29A, TACCGAC. 
While the specific teachings in the present invention used only a DNA 
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regulatory sequence which includes a CCGAC subsequence as the C- 
repeat/DRE core regulatory sequence, Applicants believe that other C- 
repeat/DRE regulatory sequences exist which belong to a broader CCG family 
of regulatory sequences. By screening plant genomes according to the 
methodology taught herein using other members of the CCG family, additional 
regulatory sequences as well as the binding proteins which bind to these 
regulatory sequences can be identified. For example, plants which are known 
to exhibit a fonn of environmental stress tolerance can be screened according 
to the blue colony assay and other screening methodologies used in the 
present invention with other members of the CCG family in order to identify 
other binding proteins and their gene sequences. Examples of other members 
of the CCG family include, but are not limited to, environmental stress response 
regulatory elements which include one of the following sequences: CCGAA, 
CCGAT, CCGAC. CCGAG, CCGTA, CCGTT, CCGTC, CC6TG, CCGCA, 
CCGCT. CCGCG. CCGCC. CCGGA, CCGGT, CCGGC, CCGGG, AACCG. 
ATCCG, ACCCG, AGCCG. TACCG, TTCCG, TCCCG, TGCCG, CACCG, 
CTCCG, CGCCG, CCCCG, GACCG, GTCCG, GCCCG, GGCCG, ACCGA, 
ACCGT, ACCGC, ACCGG, TCCGA, TCCGT, TCCGC. TCCGG, CCCGA, 
CCCGT, CCCGC, CCCGG. GCCGA, GCCGT, GCCGC, and GCCGG. 

Applicants also believe that other families of environmental stress 
tolerance DNA regulatory sequences, other than the CCG family may exist. 
The methodologies of the present invention may be used once such other 
families are identified in order to identify specific environmental stress tolerance 
DNA regulatory sequences and associated binding proteins. 

3. identification Of Environmental Stress Tolerance Regulatory 
Gene Sequences Using Target Regulatory Sequence 

It is possible to take a cDNA library of at least a portion of a plant 
genome and screen the cDNA library for the presence of regulatory gene 
sequences which encode binding proteins capable of binding to a target 
regulatory sequence. As used here, a target DNA regulatory sequence refers 
to a sequence to which a binding protein for one or more environmental stress 
tolerance genes binds. Permutations of the CCG and CCGAC families of DNA 
regulatory sequences represent examples of target DNA regulatory sequences. 
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As detailed in Example 1 herein, this was the approach was used to identify 
CBF1, a sequence which encodes a binding protein for the Arabadopsis DNA 
regulatory sequence, from an Arabadopsis cDNA library . 

First a target regulatory sequence is selected. The target regulatory 
sequence is preferably native to the plant from which the cDNA library being 
screened is derived. 

Once a target regulatory sequence is selected, the target regulatory 
sequence is fused to a reporter gene and introduced into a microorganism. 
Expression of the reporter gene can be activated by a protein which includes a 
binding domain capable of binding to the target DNA regulatory sequence and 
an activation domain capable of activating transcription. 

Sequences from a cDNA library of at least a portion of a plant genome 
are then fused to a sequence which encodes a functional activation domain in 
the microorganism. The fused sequences are then introduced into the 
microorganism. It is possible that the sequence from the cDNA library may 
already encode a functional activation domain, for example as described herein 
in Example 1. 

Microorganisms which express the reporter gene are then selected 
Since only those microorganisms which express a fusion protein which 
includes a binding domain for the target DNA regulatory sequence and an 
activation domain will stimulate expression of the reporter gene, expression of 
the reporter gene indicates expression of such a fusion protein. 

The gene sequence from the cDNA library introduced into the 
microorganism which stimulates expression of the reporter gene is then 
identified. 

According to the above method, the target DNA regulatory sequence 
preferably includes the subsequence CCG and more preferably includes the 
subsequence CCGAC. 

The "one-hybrid" strategy described in Li, J. J. and I. Herskowitz 
Science 262:1870-1874 (1993) and used in Example 1 to screen Arabidopsis 
CDNA is an example of this method. This method can be used to screen any 
plant species for cDNAs that encode a target regulatory sequence, such as a 
C-repeat/DRE regulatory sequence. According to the "one hybrid" strategy 
yeast strains are constructed that contain a /acZ reporter gene with either wild- 
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type or mutant versions of target regulatory sequences in place of the normal 
UAS (upstream activator sequence) of the GAL1 promoter. Yeast strains 
carrying these reporter constructs produce low levels of Rbeta-galactosidase 
and form white colonies on filters containing X-gal. Reporter strains carrying 
wild-type target regulatory sequences are transformed with a cDNA expression 
library that contains random cDNA inserts fused to the acidic activator domain 
of the yeast GAL4 transcription factor "GAL4-ACr'. Recombinant plasmids in 
the expression library that contain a cDNA insert encoding a C-repeat/DRE 
binding domain fused to GAL4-ACT will express fusion proteins which bind 
upstream of the lacZ reporter genes carrying the wild-type target regulatory 
sequence, activate transcription of the lacZ gene, and result in yeast forming 
blue colonies on X-gal-treated filters. Alternatively, the sequence from the 
cDNA library introduced into the microorganism may. as was observed in 
Example 1 . include a sequence encoding an activator domain and thus not 
utilize the acidic activator domain of the yeast GAL4 transcription factor "GAL4- 
ACT". 

Recombinant plasmids from such "blue yeast" are then isolated and 
transformed back into reporter strains that contain either a wild-type or mutant 
version of target regulatory sequence fused to the lacZ gene. The plasmids 
that are desired are those that turn the former strains blue, but not the later, 
indicating that the cloned DNA binding domain is specific for the target 
regulatory sequence. 

Based on presence of an AP2 binding domain in CBF1, CBF2 and 
CBF3. Applicants believe that an AP2 binding domain is present in a significant 
number of the environmental stress tolerance regulatory binding proteins 
Accordingly, it is believed that the specificity of the above method for screening 
for gene sequences encoding a regulatory binding protein can optionally be 
improved by first selecting cDNA from a plant genome library which includes a 
potential AP2 domain site. This can be routinely done by selecting probes for 
selecting sequences in the library which include potential AP2 domain 
sequences. 
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4. Screening For Expression Of Environmental 
Stress Tol ranee Regulatory Prot in 

Once one or more microorganisms are selected which are believed to 
express a protein capable of binding to the target regulatory element and 
activate expression of the reporter gene, further analysis can be performed to 
identify and isolate full length cDNAs; i.e. cDNAs that encode the entire protein 
that binds to the target regulatory sequence. The coding sequence for the 
protein can then cloned into an expression vector, such as the pET bacterial 
expression vectors (Novagen), and used to produce the protein at high levels. 
The protein can then be analyzed by gel retardation experiments (See Example 
IF) to confirm that it binds specifically to the target regulatory sequence. 

Potential sequences can be further screened using known regulatory 
gene sequences, such as CBF1 , CB2, and CBF3, or the presence of an AP2 
domain which is believed to be common to a significant class of this genes. 
Once identified, particular sequences can be transformed into yeast to test for 
activation of expression of a reporter gene, for example as described in 
Example 1E. 



5. Screening For Binding To Target Regulatory Sequence 

Once a regulatory gene sequence is identified, the sequence can be 
introduced into a microorganism in order to express the protein encoded by the 
sequence. A gel shift assay, such as the one described in Example 1F, can 
then be used to test for in vitro binding of the expressed protein to the target 
DNA regulatory sequence. 

Mutagenesis of the target DNA regulatory sequence can also be 
performed in order to evaluate the binding selectivity of the expressed protein. 
It is prefen-ed that the expressed protein selectively bind to the target DNA 
regulatory sequence over related sequences with one or more base differences 
from the target DNA regulatory sequence. For example. Figure 5 is a 
photograph of a gel from a shift assay in which CBF1 was shown to selectively 
bind to the wild-type C-repeat/DRE CCGAC. 
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6. Altering The Environmental Stress Tol ranc of a Plant. 

The present invention also provides a method for recombinant 
engineered plants with a new or altered response to one or more 
5 environmental stresses. 

According to one embodiment, a copy of a gene native to a plant which 
encodes a binding protein according to the present invention is recombinantly 
introduced into the plant such that the plant expresses a recombinant binding 
protein encoded by the recombinant copy of the gene. 

According to another embodiment, a non-native gene which encodes a 
binding protein according to the present invention is recombinantly Introduced 
into a plant such that the plant expresses a recombinant binding protein 
encoded by the recombinant non-native gene. 

According to yet another embodiment, a native or non-native DNA 
15 regulatory sequence is recombinantly introduced into a plant such that the 
recombinant DNA regulatory sequence regulates the expression of one or 
more environmental stress tolerance genes in the plant. The plant includes a 
gene which encodes a binding protein capable of binding to the recombinant 
DNA regulatory sequence. 
20 In yet another embodiment, a native or non-native promoter is 

recombinantly introduced into a plant such that the recombinant promoter 
regulates the expression of a binding protein which binds to a DNA regulatory 
sequence. 

According to each of the above embodiments, unless othenwise 
25 specified, the gene encoding the binding protein, the promoter promoting the 
expression of the binding protein, the DNA regulatory sequence, and the 
environmental stress tolerance genes may be non-recombinant or recombinant 
sequences. The recombinant sequences may be native to the plant or may be 
non-native to the plant. All the above pemnutations are intended to fall within 
30 the scope of the present invention. 

As an example, many plants increase in freezing tolerance in response 
to low non-freezing temperatures, a process known as cold acclimation. A 
large number of biochemical changes occur during cold acclimation including 
the activation of COR (COId Regulated) genes. These genes, which are also 
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expressed in response to dehydration (e.g., drought and high salinity), are 
thought to help protect plant cells against the potentially deleterious effects of 
dehydration associated with freezing, drought and high salinity stress. Indeed, 
expression of the COR15a gene in plants grown at normal temperatures (22 
"C) enhances the freezing tolerance of chloroplasts. 

By manipulating the expression of COR genes, the stress tolerance of 
crop and horticultural plants could be improved, e.g., engineer broader climate 
ranges; target stress resistance to stress-sensitive parts of plants; render 
plants stress-resistant when a stress condition (frost and drought) is imminent. 
To bring about these effects, however, the expression of the COR genes must 
be manipulated. The gene, CBF1, that encodes the transcription factor that 
binds to the C-repeat/DRE regulatory element present in the promoters of all 
COR genes described to date has been isolated. CBF1 in yeast activates 
expression of reporter genes that have been fused to the C-repeat/DRE 
element. Further, expression of CBF1 in plants has been shown to activate the 
expression of COR genes. 

By introducing modified versions of sequences encoding regulatory 
binding proteins, such as CBF1, into plants, the expression of COR genes can 
be modified, and thereby enhance the freezing and dehydration tolerance of 
plants. 

In each of the above embodiments, expression of the recombinant copy 
of the regulatory gene may be under the control of a promoter. The promoter 
may be recombinant or non-recombinant. In the case of recombinant 
promoters, the promoter may be native or non-native to the plant. 

When a recombinant promoter is used, the promoter can be selected to 
cause expression of the binding protein in a manner which is different than how 
the binding protein is expressed by the plant in its native state. For example, 
the promoter may increase the level at which the binding protein is expressed, 
express the binding protein without being induced by an environmental stress 
and/or express the binding protein in response to a different form or degree of 
environmental stress than would othenvise be needed to induce expression of 
the binding protein. The promoter may also be inducible by an exogenous 
agent. For example, a strong constitutive promoter could be used to cause 
increased levels of COR gene expression in both non-stress and stressed 
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plants which in turn, results in enhanced freezing and dehydration tolerance. 
Examples of such strong constitutive promoters-include but are not limited to 
the nopaline synthase (NOS) and octopine synthase (OCS) promoters, the 
cauliflower mosaic virus (CaMV) 19S and 35S (Odell et a!.. Nature 313: 810- 
5 812 (1985)) promoters or the enhanced CaMV 35S promoters (Kay et al., 
Science 236; 1299-1302 (1987)). 

A tissue-specific promoter could also be used to alter COR gene 
expression in tissues that are highly sensitive to stress such as embryos in the 
seed, flower and fruit, thereby enhancing the stress tolerance of these tissues. 

10 Embryo -active promoters include promoters such as the B. napus napin 

promoter (US Patent 5,420,034), the soybean 7S promoter, the Arabidopsis 
12S globulin (cruifehn) promoter (Pang, et al. Plant Molecular Biology 11:805- 
820 (1988)), or the maize globulin promoter (Khz et al. Plant Physiol. 91:636 
(1989): US Patent No. 5,773,691) for use in cereal embryos. 

^5 Promoters useful in expressing foreign genes in fruits (Cordes et a!., 

Plant Cell 1:1025-1034 (1989); Deikman and Fischer, EMBO J. 7: 3315-3320 
(1988); Delia Penna et al., Proc. Natl. Acad. Sci. USA 83: 6420-6424 (1986)) 
could also be used to alter COR gene expression in fruits. Examples include, 
but are not limited to, the fruit-specific promoter that was used to express an 

20 ADP glucose pyrophosphorylase gene in order to increase the solid content of 
tomato fruit (Kishore, PCT App. WO 91/19806), the promoter from the 2A11 
genomic clone (Pear, et al. Plant Mol. Biol. 13: 639-651 (1989); US Patent No. 
4,943,674) that can be used to control expression of ADP glucose 
pyrophosphorylase in tomato fruit, the E4 and ESpromoters (Deikman, et a!., 

25 EMBO J. 7: 3315-3320 (1988); US Patent No. 5,545,815), the promoter for 
polygalacturonase , the raspberry fruit promoter described in US Patent No. 
5,783,393, fruit-active promoters such as the E8 promoter from tomatoes, and 
citrus fruit-specific or fruit-active promoters that can be isolated from the 
CitMT45 cDNA (Moriguchi et al., Gene 12: 221-227 (1998)) and pSPS2 

30 (Komatsu et al., Mol, Gen. Genet. 252:346-351 (1996)). 

Promoters known to be expressed in developing flowers, particularly in 
the carpel or pistil tissues, could also be used to alter COR gene expression in 
flowers. Examples of such promoters include the DefH9 promoter that was 
used to make parthenocarpic plants and is expressed in the petals, stamens, 
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carpels and developing ovules (Rotino et al. Nat Biotechnol 15:1398- 
401 (1997)), the SK2-promoter that was shown to express in the pistil (Ficker et 
aL, Plant Mol Biol 35: 425-31 (1997)). and the Agamous promoter and 
intergenic region that was used to express in early and late flowers, and in the 
5 inner two whorls of flowers (Sieburth and Meyerowitz, Plant Cell 9: 355-65 
(1997)). 

Other tissue-specific promoters that could be used to alter COR gene 
expression in specific tissues include, but are not limited to, seed-specific 
promoters for the B. napus napin gene (U.S. Patent No. 5,420,034), the 

10 soybean 7S promoter, the Arabidopsis 12S globulin (cruiferin) promoter (Pang, 
et aL Plant Molecular Biology 11: 805-820 (1988)), the maize 27kd zein 
promoter, the rice glutelin 1 promoter and the phytohemaglutinin gene, tuber- 
specific promoters such as the patatin promoter, and the promoter for the small 
subunit of ribuloe-1,5-bis-phosphate carboxylase (ssRUBISCO) whose 

15 expression is activated in photosynthetic tissues such as leaves. It should be 
noted that other promoters that are known or found to cause specific 
expression in flowers, seeds or fruits of plants or express in these or other 
tissues of the plants to cause transcription in plant cells could also be used to 
alter COR gene expression in the specific tissues according to the present 

20 invention. 

Altering COR gene expression in specific tissues of plants such as 
flowers, fruits or seeds may increase frost tolerance of these tissue and prolong 
the growing seasons for plants. Examples of the specific tissues of plants 
according to the present invention include, but not limited to, frost-resistant 

25 flowers in strawberries, peaches, blueberries, cherries, apricots, daffodils, 
apples, and plums; frost-resistant canola or rape seeds for preventing the 
formation of green seeds at harvest; frost-resistant bariey seeds for maintaining 
malting ability; and frost-resistant fruits including true benries such as tomato, 
grape, blueberry, cranberry, currant, and eggplant; stone fruits (drupes) such 

30 as cherry, plum, apricot, peach, nectarine and avocado; and compound fruits 
(druplets) such as raspberry and blackberry; in citrus fruits such as oranges, 
lemons, grapefruit and tangerines; and in melons such watermelon, 
cantaloupe, honeydew, cucumber, and squash. 



40 



wo 99/38977 



PCT/US99/01895 



In addition, the COR gene expression can also be altered in specific 
tissues of the following plants according to the present invention: cauliflower 
(Brassica oleracea), artichoke (Cynara scolymus), fruits such as apple (Malus, 
e.g. domesticus), banana (Musa, e.g. acuminata), beaies (such as the currant, 
5 Ribes, e.g. rubrum). cherries (such as the sweet cherry, Prunus, e.g. avium), 
cucumber (Cucumis. e.g. satlvus), grape (Vitis, e.g. vinifera), lemon (Citrus 
limon), melon (Cucumis meio), nuts (such as the walnut, Juglans, e.g. regia; 
peanut, Arachis hypogeae). orange (Citrus, e.g. maxima), peach (Prunus, e.g. 
perstca), pear (Pyra, e.g. communis), pepper (Solanum, e.g. capsicum), plum 

10 (Prunus, e.g. domestica), strawberry (Fragaria, e.g. moschata), tomato 

(Lycopersicon, e.g. esculentum), leafs, such as alfalfa (Medicago, e.g. sativa), 
cabbages (such as Brassica oleracea), endive (Cichoreum, e.g. endivia), leek 
(Allium, e.g. porrum), lettuce (Lactuca, e.g. sativa), spinach (Spinacia e.g. 
oleraceae), tobacco (Nicotiana, e.g. tabacum), roots, such as arrowroot 

15 (Maranta, e.g. arundinacea). beet (Beta, e.g. vulgaris), carrot (Daucus, e.g. 

carota), cassava (Manihot, e.g. esculenta), turnip (Brassica, e.g. rapa), radish 
(Raphanus, e.g. sativus). yam (Dioscorea, e.g. esculenta), sweet potato 
(Ipomoea batatas) and seeds, such as bean (Phaseolus, e.g. vulgaris), pea 
(Pisum, e.g. sativum), soybean (Giycin, e.g. max), wheat (Triticum, e.g. 

20 aestivum), barley (Hordeum, e.g. vulgare), corn (Zea, e.g. mays), rice (Oryza, 
e.g. sativa), tubers such as kohlrabi (Brassica, e.g. oleraceae), and potato 
(Solanum, e.g. tuberosum). 

Alternatively, an inducible promoter may be used to control the 
25 expression of the regulatory binding protein, such as CBF1, in plants. Because, 
in some cases, constitutive expression of higher levels of CBF proteins may 
have some detrimental effects on plant growth and development, the controlled 
expression of CBF genes is especially advantageous. For example, a 
promoter could be used to induce the expression of CBF proteins only at a 
30 proper time, such as prior to a frost that may occur eariier or later in the 

growing season of a plant, thereby prolonging the growing season of a crop 
and increasing the productivity of the land. This may be accomplished by 
applying an exogenous inducer by a grower whenever desired. Alternatively, a 
promoter could be used which turns on at a temperature that is warmer than 
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the temperature at which the plant normally exhibits cold tolerance. This would 
enable the cold tolerance thermostat of a plant to be altered. Similarly, a 
promoter can be used which turns on at a dehydration condition that is wetter 
than the dehydration condition at which the plant normally exhibits dehydration 
5 tolerance. This would enable the level at which a plant responds to 
dehydration to be altered. 

Promoters which are known or are found to cause inducible 
transcription of the DNA into mRNA in plant cells can be used in the present 
invention. Such promoters may be obtained from a variety of sources such as 

10 plant and inducible microbial sources, and may be activated by a variety of 

exogenous stimuli, such as cold, heat, dehydration, pathogenesis and chemical 
treatment. The particular promoter selected is preferably capable of causing 
sufficient expression of the regulatory binding protein, such as CBF1, to 
enhance plant tolerance to environmental stresses. Examples of promoters 

15 which may be used include, but are not limited to, the promoter for the DRE (C- 
repeat) binding protein gene dreb2a (Liu, et al. Plant Cell 10: 1391-1406 
(1998)) that is activated by dehydration and high-salt stress, the promoter for 
delta 1-pyrroline-5-carboxylate synthetase (P5CS) whose expression is 
induced by dehydration, high salt and treatment with plant hormone abscisic 

20 acid (ABA) (Yoshiba, et al., Plant J. 7 751-760 (1987)), the promoters for the 
rd22 gene from Arabidopsis whose transcription is induced under by salt 
stress, water deficit and endogenous ABA (Yamaguchi-Shinozaki and 
Shinozaki, Mol Gen Genet 238 17-25 (1993)), the promoter for the rd29b gene 
(Yamaguchi-Shinizaki and Shinozaki, Plant Physiol., 101 1119-1120 (1993)) 

25 whose expression is induced by desiccation, salt stress and exogenous ABA 
treatment (Ishitani et aL, Plant Ceil 10 1151-1161 (1998)), the promoter for the 
rab18 gene from Arabidopsis whose transcripts accumulate in plants exposed 
to water deficit or exogenous ABA treatment, and the promoter for the 
pathogenesis-related protein la (PR-la) gene whose expression is induced by 

30 pathogenesis organisms or by chemicals such as salicylic acid and polyacrylic 
acid. 

It should be noted that the promoters described above may be further 
modified to alter their expression characteristics. For example, the 
drought/ABA inducible promoter for the rab18 gene may be incorporated into 
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seed-specific promoters such that the rab18 promoter is drought/ABA inducible 
only when developing seeds. Similarly, any number of chimeric promoters can 
be created by ligating a DNA fragment sufficient to confer environmental stress 
inducibility from the promoters described above to constitute promoters with 
other specificities such as tissue-specific promoters, developmentally regulated 
promoters, light-regulated promoters, hormone-responsive promoters, etc. This 
should result in the creation of chimeric promoters capable of being used to 
cause expression of the regulatory binding proteins in any plant tissue or 
combination of plant tissues. Expression can also be made to occur either at a 
specific time during a plant's life cycle or throughout the plant's life cycle. 

According to the present invention, an expression vector can be 
constructed to express the regulatory binding protein in the transformed plants 
to enhance their tolerance to environmental stresses. In one embodiment, the 
DNA construct may contain (1) an inducible promoter that activates expression 
of the regulatory binding protein in response to environmental stimuli; (2) a 
sequence encoding the regulatory binding protein; and (3) a 3' non-translated 
region which enables 3' transcriptional termination and polyadenylation of the 
mRNA transcript. The inducible promoter may be any one of the natural or 
recombinant promoters described above. The gene encoding the regulatory 
binding protein can be any one disclosed in the present invention. The 3' region 
downstream from this gene should be capable of providing a polyadenylation 
signal and other regulatory sequences that may be required for the proper 
expression and processing of a mRNA may be operably linked to the 3'end of a 
structural gene to accomplish the invention. This may include the native 3' end 
of the homologous gene form which the regulatory binding protein and/or the 
inducible promoter is derived, the 3' end from a heterologous gene encoding 
the same protein from other species, the 3' end from viral genes such as the 3' 
end of the 35S or the 1 9S cauliflower mosaic virus transcripts, the 3' end of the 
opine synthesis genes of Agrobacterium tumefaciens. or the 3' end sequences 
from any source such that the sequence employed provides the necessary 
regulatory information within its nucleic acid sequence to result in the proper 
expression of the promoter/coding region combination to which the 3' end 
sequence is operably linked. 
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A variety of expression vectors can be used to transfer the gene 
encoding the regulatory binding protein as well as the desired promoter into the 
plant. Examples include but not limited to those derived from a Ti plasmid of 
Agrobacterium tumefaciens, as well as those disclosed by Herrera-Estrella, L.. 
et al., Nature 303: 209(1983), Bevan, M., Nucl. Acids Res. 12: 8711-8721 
(1984), Klee, H. J., Bion"echnology 3; 637-642 (1985), and EPO Publication 
120,516 (Schilperoort et al.) for dicotyledonous plants. Alternatively, non-Ti 
vectors can be used to transfer the DNA constructs of this invention into 
monotyledonous plants and plant cells by using free DNA delivery techniques. 
Such methods may involve, for example, the use of liposomes, electroporation, 
microprojectile bombardment, silicon carbide wiskers, viruses and pollen. By 
using these methods transgenic plants such as wheat, rice (Christou, P., 
Bio/Technology 9: 957-962 (1991)) and corn (Gordon-Kamm, W., Plant Cell 2: 
603-618 (1990)) are produced. An immature embryo can also be a good target 
tissue for monocots for direct DNA delivery techniques by using the particle 
gun (Weeks. T. et al.. Plant Physiol. 102: 1077-1084 (1993); Vasil, V.. 
Bion-echnology 10: 667-674 (1993); Wan, Y. and Lemeaux, P., Plant Physiol. 
104: 37-48 (1994), and for Agrobacterium-mediated DNA transfer (Hiei et al., 
Plant J. 6: 271-282 (1994); Rashid et al.. Plant Cell Rep. 15: 727-730 (1996); 
Dong, J., et al., Mol. Breeding 2: 267-276 (1996); Aldemita, R. and Hodges, T.. 
Planta 199: 612-617 (1996); Ishida et al.. Nature Biotech. 14: 745-750 (1996)). 

In one embodiment, the plasmid vector pMEN020 is preferred, which is 
derived from a Ti plasmid pMON 10098 which is the type of binary vector 
described in U.S. Patent Nos. 5,773,701 and 5,773,696. PMEN20 differs from 
PMON10098 by the substitution of a Kpnl, Sail. Saci, Sacll, NotI, and Xbal 
restriction sites between the ECaMV 353 promoter and the E9 3' region. 
Plasmid pMON 10098 contains the following DNA segments. Starting at the 
bottom of the plasmid map is the origin of bacterial replication for maintenance 
in E. coli (ori-322). Moving in a counter-clockwise direction on the map, next is 
ori-V, which is the vegetative origin of replication (Stalker et al. Mol. Gen. 
Genet. 181:8-12 (1981)). Next is the left border of the T-DNA. Next is the 
chimeric gene used as the selectable marker. The chimera includes the 0.35 
kilobase (kb) of the cauliflower mosaic virus 35S promoter (P-35S) (Odell et al. 
(1985) Nature 313:810-812). , a 0.84 kb neomycin phosphotransferase type II 
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gene (KAN) and a 0.25 kb 3* non-translated region of the nopaline synthase 
gene (NOS 3') (Fraley et al. (1983) Proc. Natl. Acad. Sci. USA 80:1803-1807). 
The next sequence contains the enhanced CaMV 35S promoter and E9 3* 
region gene cassette and restriction sites for inserting genes such as the 
5 coding region of CBF genes. This chimeric gene cassette ends with the 0.65 kb 
of the E9 3' region from the pea small subunit of RUBISCO gene (U.S. Patent 
No. 5 773,701). Next is the right border of the T-DNA. Next is the 0.93 kb 
fragment isolated from transposon Tn7 that encodes the bacterial 
spectinomycin/streptomycin resistance (Spc/Str), which is a determinant for 

10 selection in E. coli and Agrobacterium tumefaciens (Fling et al., NucL Acids 
Res, 13:7095-7106 (1985)), 

The pMEN020 plasmid construct is a binary cloning vector that contains 
both E. coli and Agrobacterium tumefaciens origins of DNA replication but no 
wr genes encoding proteins essential for the transfer and integration of the 

15 target gene inserted in the T-DNA region. PMEN020 requires the trfA gene 
product to replicate in Agrobacterium. The strain of Agrobacterium containing 
this trfA gene is called the ABI strain and is described below and in U.S. Patent 
Nos. 5,773,701 and 5,773,696. This cloning vector serves as an £ coli- 
Agrobacterium tumefaciens shuttle vector. All of the cloning steps are carried 

20 out in E. coli before the vector is introduced into ABI strain of Agribacterium 
tumefaciens. 

in another embodiment, pMENOSO is preferred, which is derived from 
pMEN020 by replacing the Nptll kanamycin resistance gene with the Bar gene 
(US Patent No. 5,646,024) by using the same cloning method described above 
25 for PMEN020. 

The recipient ABI strain of Agribacterium carries a modified defective Ti 
plasmid that serves as a helper plasmid containing a complete set of wr genes 
but lacks portions or all of the T-DNA region, ABI is the A208 Agrobacterium 
tumefaciens strain carrying the disarmed pTiC58 plasmid pMP90RK (Koncz et 
30 al. Mol. Gen. Genet. 204:383-396 (1986)). The disarmed Ti plasmid provides 
the trfA gene functions that are required for autonomous replication of the 
binary vectors after transfer into the ABI strain. When plant tissue is Incubated 
with the ABI::binary vector strains, the vectors are transfen-ed to the plant cells 
by the vir functions encoded by the disarmed pMP90RK Ti plasmid. After the 
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introduction of the binary vector into the recipient Agribactehum, the vir gene 
products mobilize the T-DNA region of the pMEN020 plasmid to insert the 
target gene, e.g. the gene encoding the regulatory binding protein, into the 
plant chromosomal DNA, thus transfonming the cell. 
5 It should be noted that methods for transforming a wide variety of 

different dicots and obtaining transgenic plants are well documented in the 
literature (See Gasser and Fraley Science 244:1293 (1989); Fisk and 
Dandekar, Scientia Horticulturae 55: 5-36 (1993); Christou Agro Food Industry 
Hi Tech March/April: p. 17 (1994), and the references cited therein). 

10 Methods for producing transgenic plants among the monocots are also 

available. Successful transformation and plant regeneration have been 
achieved in asparagus (Asparagus officinalis] Bytebier et aL Proc. Natl. Acad. 
Sci. USA 84:5345 (1987)); barley (Hordeum vulgare\ \Nat\ and Lemaux, Plant 
Physiol 104:37 (1994;); maize (Zea mays; Gordon-Kamm et a!., Plant Cell 

15 2:603 (1990); Fromm et al. Bio/Technology 8:833 (1990); Koziel et al. 
Bio/Technology 11: 194 (1993)); oats {Avena sativa, Somers et al. 
Bio/Technology 10: 1589 (1992)); orchardgrass (Dactylis glomerata\ Horn et 
al. Plant Cell Rep. 7: 469 (1988;); rice {Oryza saf/Va, including indica and 
japonica varieties; Toriyama et al. Bio/Technology 6:10 (1988); Zhang et al. 

20 Plant Cell Rep. 7: 379 (1988); Luo and Wu Plant Mol. Biol. Rep. 6:165 (1988;; 
Zhang and Wu, Theor. Appl. Genet. 76: 835 (1988); Christou et al. 
Bio/Technology 9: 957 (1991); rye (Secale cereale\ De la Pena et al. Nature 
325: 274 (1987)); sorghum (Sorghum bicolon Cassas et al, Proc, Natl. Acad. 
Sci. USA 90:11212 (1993;); sugar cane (Saccharum spp.; Bower and Birch 

25 Plant J. 2: 409 (1992)); tall fescue (Festuca arundinacea] Wang et al. 

Bio/Technology 10:691 (1992)); turfgrass (Agrostis palustris\ Zhong et al. Plant 
Cell Rep. 13:1 (1993)); wheat (Triticum aestivum: Vasil et al. Bio/Technology 
10: 667 (1992); Troy Weeks et al. Plant Physiol. 102:1077 (1993); Becker et 
al. Plant J. 5:299(1994)). 

30 After transfonnation of cells or protoplasts, the choice of methods for 

regenerating fertile plants is not particularly important. Suitable protocols are 
available for Leguminosae (alfalfa, soybean, clover, etc.), Umbelliferae (Carrot, 
celery, parsnip), Cruciferae (cabbage, radish, rapeseed, broccoli, etc.), 
Curcurbitaceae (melons and cucumber), Gramineae (wheat, corn, rice, barley, 
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millet, etc.), Solanaceae (potato, tomato, tobacco, peppers, etc.), and various 
other crops See protocols described in Ammirato et al. (1984) Handbook of 
Plant Cell Culture -Crop Species. Macmillan Pubi. Co. Shimamoto et al. 
Nature 338:274-276 (1989); Fromm et al.. Bio/Technology 8:833-839 (1990); 

5 Vasi! et al. Bio/Technology 8:429-434 (1990). 

It is envisioned that the present invention can be used to introduce, 
change and/or augment the environmental stress tolerance of a plant by 
introducing and causing the expression of environmental stress tolerance in a 
manner which the plant does not exhibit in its native form. For example, by 

0 using different promoters in combination with recombinant regulatory genes, 
native environmental stress tolerance genes can be expressed independent of 
environmental stress, made responsive to different levels or types of 
environmental stress, or rendered inducible independent of an environmental 
stress. Further, selection of the promoter can also be used to determine what 

5 tissues in the plant express the binding protein as well as when the expression 
occurs in the plant's lifecycie. By selecting a promoter which regulates in what 
tissues and when in a plant's life the promoter functions to regulate expression 
of the binding protein, in combination with the selecting how that promoter 
regulates expression (level of expression and/or type of environmental or 

0 chemical induction), an incredible range of control over the environmental 
stress responses of a plant can be achieved using the present invention. 

By recombinantly introducing a native environmental stress tolerance 
gene into a plant in combination with a recombinant regulatory gene under the 
control of an inducible promoter, a plant can be engineered which includes its 

5 native environmental stress tolerance as well as inducible environmental stress 
tolerance. This might be useful for inducing a cold stress tolerance reaction in 
anticipation of a frost. 

By recombinantly introducing a non-native environmental stress 
tolerance gene into a plant in combination with a recombinant regulatory gene, 

D a plant can be engineered which includes environmental stress tolerance 

properties that the plant would not othenA^ise have, in this regard, plants from 
warmer climates can be engineered to include one or more cold tolerance 
genes along with a regulatory gene needed to cause expression of the cold 
tolerance genes in the plant so that the engineered plant can survive better in a 
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colder climate. Similarly, a plant can be engineered to include one or more 
dehydration tolerance genes along with a regulatory gene needed to cause 
expression of the dehydration tolerance gene so that the engineered plant can 
grow better in a dryer climate. In this regard, it should be possible to take a 
plant which grows well in a first climate and engineer it to include stress 
tolerance genes and regulatory genes native to a second climate so that the 
plant can grow well in the second climate. 

By modifying the promoter controlling the expression of the gene 
encoding a binding protein which regulates the expression of environmental 
stress tolerance genes, the operation of native, non-recombinant environmental 
stress tolerance genes and regulatory genes can be changed. For example, 
the conditions under which the stress tolerance genes are expressed can be 
changed. Expression can also be rendered inducible by an exogenous agent. 

7. Methods For Detecting Stress Tolerance 
Regulatory Gene Homologs. 

Once one DNA sequence encoding an environmental stress tolerance 
regulatory binding protein has been identified, several methods are available 
for using that sequence and knowledge about the protein it encodes to identify 
homologs of that sequence from the same plant or different plant species. For 
example, let us assume that a cDNA encoding a first target binding domain has 
been isolated from plant species "A." The DNA sequence encoding the first 
target DNA regulatory sequence could be radiolabeled and used to screen 
cDNA libraries of plant species "A." or any other plant species, for DNA inserts 
that encode proteins related to the first target DNA regulatory sequence. This 
could be done by screening colony or phage "lifts" using either high (Tm of 
about -10 °C) or low (Tm of about -30 °C or lower) stringency DNA 
hybridization conditions (Sambrook. J. et al. Molecular Cloning. A Laboratory 
Manual Cold Spring Harbor Laboratory Press, Cold Spring Harbor. NY., 2nd 
Ed. (1989)). CDNA inserts that hybridize with the first target DNA regulatory 
sequence could be sequenced and compared to the original first target DNA 
regulatory sequence. If the insert is confimied to encode a polypeptide similar 
to the first target DNA regulatory sequence, the insert could be cloned into an 
expression vector to produce the encoded protein. The protein would then be 
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analyzed by gel retardation experiments to confirm that it binds specifically to 
the first target DNA regulatory sequence. 

It is recognized that not all proteins that bind to a first target DNA 
regulatory sequence will be transcriptional activators. However, a number of 
routine tests may be performed in order to determine whether a particular 
protein is in fact a transcriptional activator. One test involves expressing the 
protein in yeast strains which contain the target DNA regulatory sequence 
fused to the lacZ reporter gene, as described above. If the protein is a 
transcriptional activator, it should activate expression of the reporter gene and 
result in blue colonies. 

Another test is a plant transient assay. In this case, a reporter gene, 
such as GUS, carrying the target DNA regulatory sequence as an upstream 
activator is introduced into plant cells (e.g. by particle bombardment) with or 
without a the putative transcriptional activator under control of a constitutive 
promoter. If the protein is an activator, it will stimulate expression of the 
reporter (this may be further enhanced if the plant material is placed at low 
temperature or is subjected to water stress as the C-repeat/DRE is responsive 
to low temperature and dehydration). 

Significantly, once a target DNA regulatory sequence is identified, the 
sequence can be fused to any potential activator or repressor sequence to 
modify expression of plant genes that carry the target regulatory sequence as a 
control element. That is, the DNA regulatory sequence can be used to target 
■managed" expression of the battery of environmental stress tolerance related 
genes in a given plant species. 

It is possible that the target DNA regulatory sequence of the regulatory 
element that imparts environmental stress tolerance related gene expression in 
plant species "A" might be slightly different from the analogous target DNA 
regulatory element that imparts environmental stress tolerance in species "B." 
Thus, optimal regulation of the battery of environmental stress tolerance related 
genes in a given species may require the use of the regulatory binding proteins 
from that or a closely related plant species. Knowledge of gene sequences 
which encode for proteins which bind to the DNA regulatory sequence of the 
regulatory element, in combination with knowledge of the DNA regulatory 
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sequence, greatly simplify the identification of sequences encoding binding 
proteins native to the plant species. 

With the advent of fast and efficient DNA sequencing technologies, the 
number of plant genomes recorded on computer databases is growing rapidly. 
These computer databases can be used to search for homologs to CBF 
sequences identified in this application as well as other sequences which 
encode binding proteins which regulate cold tolerance genes. As more and 
more binding protein sequences are identified and the number of computerized 
plant genome databases increase, searching computer databases for 
additional sequences encoding binding proteins which regulate cold tolerance 
genes will become increasingly simplified. 

8. Preparation Of Binding Proteins Derivatives Using 
Sequences Identified in This Application. 

According to the present invention, the binding protein is a protein 
which is capable of binding to a DNA regulatory sequence which regulates 
expression of one or more environmental stress tolerance genes in a plant. 
These DNA regulatory sequences are preferably a member of the CCG family 
of regulatory sequences and more preferably a member of the CCGAC family 
of regulatory sequences. 

Numerous amino acid sequences for CBF binding protein homologs are 
disclosed in this application including those shown in Figures 28, 14, and 188 
and listed in SEQ. I.D. Nos. 2, 13, 15, 39, 41. 43, 45, 47, 49. 51, 53, 55, 57, 59, 
61. 63, 65. 67, 69, 71, 73, 75, 77. 79. 81, 83, 85. 87, 89, 91. 93, and 95. 
Nucleic acid sequences encoding these C8F binding protein homologs are 
disclosed in this application in Figures 28. 12, 13, and 18A and listed in SEQ. 
I.D. Nos. 1, 12, 14, 40, 42, 44, 46, 48, 50, 52. 54, 56. 58, 60, 62, 64, 66, 68, 70, 
72, 74, 76, 78. 80, 82, 84, 86, 88, 90. 92. and 94. These sequences were 
derived from a variety of different plant species including Arabidopsis, Brassica 
Juncea, Brassica napus, Brassica oleracea, Brassica rapa, Glycine max, 
Raphanus sativus and Zea Maize. 

The sequences identified in these figures may generally be divided into 
three regions: AP2 domain, amino terminus domain, and carboxy terminus 
domain. Figured 19A-19E show different AP2 domains from these homologs 
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and consensus sequences between the different AP2 domains shown. 

Figure 19A shows an amino acid alignment of the AP2 domains of 
several CBF proteins with the consensus sequence between the proteins 
highlighted as well as a comparison of the AP2 domains with that of the 
tobacco DNA binding protein EREBp2. Figure 19B shows an amino acid 
alignment of the AP2 domains of several CBF proteins including dreb2a and 
dreb2b with the consensus sequence between the proteins highlighted. Figure 
19C shows an amino acid alignment of the AP2 domains of several CBF 
proteins including dreb2a, dreb2b, and tiny with the consensus sequence 
between the proteins highlighted. Figure 19D shows a consensus sequence 
corresponding to the difference between the consensus sequence shown in 
Figures 19A and tiny. Figure 19E shows a consensus sequence corresponding 
to the difference between the consensus sequence shown in Figures 19B and 
tiny. 

Figures 21 A and 21 B show different carboxy terminus domains from 
these homologs and consensus sequences between the different carboxy 
terminus domains shown. 

The binding proteins utilized in the present invention include classes of 
binding proteins which satisfy one or more of the following requirements: 

the binding protein comprises an AP2 domain which comprises a 
consensus sequence sufficiently homologous to any one of the consensus 
sequences shown in Figures 19A, 19B, or 19C that the binding protein is 
capable of binding to a CCG regulatory sequence, preferably a CCGAC 
regulatory sequence; 

the binding protein comprises an AP2 domain which comprises a 
consensus sequence shown in Figures 19A, 19B or 19C; 

the binding protein comprises an AP2 domain which comprises the 
amino acid residues shown in Figures 19D or 19E; 

the binding protein comprises an AP2 domain which is sufficiently 
homologous to at least one of the AP2 domains shown in the application such 
that it is capable of binding to a CCG regulatory sequence, preferably a 
CCGAC regulatory sequence; 

the binding protein comprises one of the AP2 domain sequences shown 
in this application, including, but not limited to SEQ. I.D. Nos. 2, 13. 15, 39, 41. 
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43. 45, 47. 49. 51. 53. 55, 57, 59, 61, 63. 65. 67. 69, 71, 73. 75. 77. 79. 81. 83. 
85. 87. 89, 91. 93, and 95; 

the binding protein comprises a sequence which comprises one of the 
amino terminus domains shown in Figure 20 (it is noted that the sequence 
need not be at the amino terminus of the binding protein); 

the binding protein comprises the consensus sequence for the amino 
terminus domains shown in Figure 20, (it is noted that the sequence need not 
be at the amino terminus of the binding protein); 

the binding protein comprises a sequence which comprises one of the 
carboxy terminus domains shown in Figure 21A (it is noted that the sequence 
need not be at the carboxy terminus of the binding protein); 

the binding protein comprises the consensus sequence for the carboxy 
terminus domains shown in Figure 21 A (it is noted that the sequence need not 
be at the carboxy terminus of the binding protein); and 

the binding protein comprises the consensus sequence for the carboxy 
temninus domains shown in Figure 21B (it is noted that the sequence need not 
be at the carboxy terminus of the binding protein). 

The sequence of the binding protein may be a naturally occuning 
sequence such as the ones shown in SEQ. ID. Nos. 2, 13, 15, 39, 41 , 43. 45, 
47. 49. 51. 53, 55. 57, 59. 61. 63. 65. 67, 69, 71. 73, 75. 77. 79, 81. 83. 85, 87, 
89. 91. 93. and 95 or may be a non-naturaliy occumng sequence. It is noted, 
however, that binding proteins according to the present invention are intended 
to encompass non-naturaliy occurring sequences which are derivatives of the 
classes of binding proteins taught herein. 

Additional binding proteins may be constructed using one of the AP2 
domains taught herein or the consensus sequence of these AP2 domains. It 
may be desirable to include with the AP2 domain a transcription activation 
region. The transcription activation region may be native to the plant or non- 
native to the plant in which the binding protein will be used. For example, the 
sequence may include a subsequence which encodes a binding domain for the 
DNA regulatory sequence fused to a transcription activating region, such as the 
transcription activating region of VP16 or GAL4. Optionally, one can include in 
the binding protein one of the amino terminus domains, the consensus 
sequence for the amino temiinus domain, one of the carboxy terminus domains 
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and/or the consensus sequence for the carboxy terminus domains. It Is noted 
that the amino terminus domain may be positioned away from the amino 
temiinus of the new binding protein and the carboxy terminus domain may be 
positioned away from the carboxy temiinus of the new binding protein. 

Optionally, the binding protein can be viewed as comprising one of the 
amino terminus domains, the consensus sequence for the amino temiinus 
domain, one of the carboxy terminus domains and/or the consensus sequence 
for the carboxy temiinus domains. It is noted that the amino terminus domain 
may be positioned away from the amino terminus of the new binding protein 
and the carboxy terminus domain may be positioned away from the carboxy 
terminus of the new binding protein. 
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EXAMPLES 

^ ' I^olVf ^ Analysis of Arabidopsis Thaliana cDNA Clone 
(CBF1) Encoding C-repeat/DRE Binding Factor 

The following example describes the isolation of an Arabidopsis 
thaliana cDNA clone that encodes a C-repeat/DRE binding factor, CBF1 (C- 
repeat/DRE Binding Factor 1). Expression of CBF1 in yeast was found to 
activate transcription of reporter genes containing the C-repeat/DRE (CCGAC) 
as an upstream activator sequence. Meanwhile, CBF1 did not activate 
transcription of mutant versions of the CCGAC binding element, indicating that 
CBF1 is a transcription factor that binds to the C-repeat/DRE. Binding of CBF1 
to the C-repeat/DRE was also demonstrated in gel shift assays using 
recombinant CBF1 protein expressed in Escherichia coli. Analysis of the 
deduced CBF1 amino acid sequence indicated that the protein has a potential 
nuclear localization sequence, a possible acidic activation domain and an AP2 
domain, a DNA-binding motif of about 60 amino acids that is similar to those 
present in Arabidopsis proteins APETALA2. AINTEGUMENTA and TINY, the 
tobacco ethylene response element binding proteins, and numerous other plant 
proteins of unknown function. 

A. Materials 

Plant material and cold treatment. A thaliana (L.) Heyn. ecotype RLD 
plants were grown in pots in controlled environment chambers at 22 "C under 
constant illumination with cool-white fluorescent lamps ( 100 pmol m'^s-^) 
essentially as described (Gilmour, S. J., Plant Physiol. 87:745-750 (1988)). 
Plants were cold-treated by placing pots in a cold room at 2.5 °C under 
constant illumination with cool-white florescent lamps ( 25 nmol m'^s'^) for the 
indicated times. 

Arabidopsis cDNA expression library. The Arabidopsis pACT cDNA 
expression library was constructed by John Walker and colleagues 
(NSF/DOE/USDA Collaborative Research in Plant Biology Program grant 
USDA 92-37105-7675) and deposited in the Arabidopsis Biological Resource 
Center (stock #CD4-10). 
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Yeast reporter strains. Oligonucleotides (Table 1) (synthesized at the 
MSU Macromolecular Structure Facility) encoding either wild-type or mutant 
versions of the C-repeat/DRE were ligated into the Sg/ll site of the /acZ reporter 
vector pBgl-lacZ (Li, J. J. and I. Herskowitz, Science 262:1870-1874 (1993); 
kindly provided by Joachim Li). The resulting reported constructs were 
integrated into the ura3 locus of Saccharomyces cerevisiae strain GGY1 (MAT 
gal4 galdO ura3 Ieu2 his3 ade2 tyr) (Li, J. J. and I. Herskowitz, Science 
262:1870-1874 (1993); provided by Joachim Li) by transformation and 
selection for uracil prototrophy. 

E. coli strains. Escherichia co// strain GM2163 containing plasmid 
PEJS251 was deposited under the Budapest Treaty on May 17, 1996 with the 
American Type Culture Collection, Rockville, Maryland as ATCC 98063. It is 
available by name and number pursuant to the provisions of the Budapest 
Treaty. 



TABLE 1 

Oligonucleotides encoding wild type and mutant versions of the C-repeat/DRE 



Oligonucleotide 


C-repeat /DRE* 


Sequence 


SEQ 
ID NO: 


MT50 


COR15a 


GatcATTTCATGGCCGACCTGC 1 i 1 M 


3 


MT52 


mC0R15a 


CACAATTTCAaGaattcaCTGCTTTTTT 


4 


MT80 


mC0R15a 


GatcATTTCATGGtatqtCTGCTTTTT 


5 


MT125 


mCOR15a 


GatcATTTCATGGaatcaCTGCTTTTT 


6 


MT68 


COR15b 


GatcACTTGATGGCCGACCTC MINI 


7 


MT66 


COR78-1 


GatcAATATACTACCGACATGAGTTCT 


8 


MT86 


COR7B-2 


ACTACCGACATGAGTTCCAAAAAGC 


9 



*The C-repeat/DRE sequences tested are either wild-type found in the 
promoters of COR15a (Baker, S. S., et al., Plant, mol. Biol. 24:701-713 (1994)), 
COR15b or COR78/RD29A (Horvath, D. P., et al., Plant Physiol. 103:1047- 
1053 (1993); Yamaguchi-shinozaki, K., et al., The Plant Cell 6:251-264 (1994)) 
or are mutant versions of the COR15a C-repeat/DRE (mcOR15a, 
M2COR15a and M3COR15a). 
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#Uppercase letters designate bases in wild type C-repeat/DRE sequences. 
The core CCGAC sequence common to the above sequences is indicated in 
bold type. Lowercase letters at the beginning of a sequence indicate bases 
added to facilitate cloning. The lowercase letters that are underlined indicate 
the mutations in the C-repeat/DRE sequence of COR15a. 

B. Methods 

Screen of Arabidopsis cDNA library. The Arabidopsis pACT 
cDNA expression library was screened for clones encoding C-repeat/DRE 
environmental stress response regulatory elements by the following method. 
The cDNA library, harbored in Escherichia co//BNN132, was amplified by 
inoculating 0.5 ml of the provided glycerol stock into 1 L of M9 minimal glucose 
medium (Sambrook, J. et al, Molecular Cloning. A Laboratory Manual (Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, NY., 2nd Ed. (1989)) and 
shaking the bacteria for 20 h at 37 °C. Plasmid DNA was isolated and purified 
by cesium chloride density gradient centrifugation (Sambrook et al (1989)) and 
transformed into the yeast GGY1 reporter strains selecting for leucine 
prototrophy. Yeast transformants that had been grown for 2 or 3 days at 30 °C 
were overlaid with either a nitrocellulose membrane filter (Schleicher and 
Schuell, Keene, NH) or Whatman #50 filter paper (Hillsboro, OR) and 
incubated overnight at 30 °C. The yeast impregnated filters were then lifted 
from the plate and treated with X-gal (5-bromo-4-chloro-3-indolyl -D- 
galactosidase) to assay colonies for beta-galactosidase activity (Li, J. J. and I. 
Herskowitz. Science 262:1870-1874 (1993)). Plasmid DNA from "positive" 
transformants (those forming blue colonies on the X-gal-treated filters) was 
recovered (Strathern. J. N., and D. R. Higgens, Methods Enzymol. 194:319- 
329 (1991)). propagated in E. coll DH5 and transformed back into the yeast 
reporter strains to confirm activity. 

Yeast transformation and quantitative beta-gaiactosidase 
assays. Yeast were transformed by either electroporation (Becker. D. M., et 
al., Methods Enzymol. 194:182-187 (1991)) or the lithium acetate/carrier DNA 
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method (SchiestI, R. H.. et al., Current Genetics 16:339-346 (1989)). 
Quantitative in vitro beta-gaiactosidase assays were done as described (Rose, 
M., et al., l\/lethods Enzymol. 101:167-180 (1983)). 

Expression of CBF1 protein in E. co//and yeast. CBF1 was 
expressed in £ coli using the pET-28a(+) vector (Novagen, iVIadison, Wl). The 
Bgl\\.Bcl\ restriction fragment of pACT-1 1 encoding CBF1 was ligated into the 
BamH\ site of the vector bringing CBF1 under control of the T7 phage 
promoter. The construct resulted in a "histidine tag." a thrombin recognition 
sequence and a "T7 epitope tag" being fused to the amino terminus of CBF1 . 
The construct was transformed into E. coli BL21 (DE3) and the recombinant 
CBF1 protein was expressed as recommended by the supplier (Novagen). 
Expression of CBF1 in yeast was accomplished by ligating restriction 
fragments encoding CBF1 (the BcH-BglW and Sg/ll-8g/ll fragments from pACT- 
11) into the BglW site of pDB20.1 (Berger, S. L., et al., Cell 70:251-265 (1992); 
kindly provided by Steve Triezenberg) bringing CBF1 under control of the 
constitutive ADC1 (alcohol dehydrogenase constitutive 1) promoter. 

Gel shift assays. The presence of expressed protein which 
binds to a C-repeat/DRE binding domain was evaluated using the following gel 
shift assay. Total soluble E. coli protein (40 ng) was incubated at room 
temperature in 10 pi of IX binding buffer [15 mM HEPES (pH 7.9). 1 mM 
EDTA, 30mM KCI, 5% glycerol, 5% BSA, ImM DTT) plus 50 ng poly(dl- 
dC):poly(dl-dC) (Pharmacia, Piscataway. NJ) with or without 100 ng competitor 
DNA. After 10 min, probe DNA (1 ng) that was ^^P-labeled by end-filling 
(Sambrook et al, 1989) was added and the mixture incubated for an additional 
10 min. Samples were loaded onto polyacrylamide gels (4% w/v) and 
fractionated by electrophoresis at 150V for 2h (Sambrook et al). Probes and 
competitor DMAs were prepared from oligonucleotide inserts ligated into the 
SamHI site of pUC118 (Vieira, J., et al.. Methods Enzymol. 153:3-11 (1987)). 
Orientation and concatenation number of the inserts were determined by 
dideoxy DNA sequence analysis (Sambrook, et al. (1989)). Inserts were 
recovered after restriction digestion with EcoRI and H/ndlll and fractionation on 
polyacrylamide gels (12% w/v) (Sambrook et al, 1989). 
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10 



15 



Northern and southern analysis. Northern and southern 
analysis was performed as follows. Total RNA was isolated from Arabidopsis 
(Gilmour, S. J., et al.. Plant Physiol. 87:745-750 (1988)) and the poly(A)* 
fraction purified using oligo dT cellulose (Sambrook, et al (1989)). Northem 
transfers were prepared and hybridized as described (Hajeia. R. K., et al., Plant 
Physiol. 93:1246-1252 (1990)) except that high stringency wash conditions 
were at 50 C in 0.1X SSPE [X SSPE is 3.6 M NaCI, 20 mM EDTA, 0.2 M Naj- 
HPO4 (pH7.7)], 0.5% SDS. Membranes were stripped in 0.1 X SSPE, 0.5% 
SDS at 95 °C for 15 min prior to re-probing. Total Arabidopsis genomic DNA 
was isolated (Stockinger, E. J., et al., J. Heredity, 87:214-218 (1996)) and 
southern transfers prepared (Sambrook et al 1989) using nylon membranes 
(MSI, Westborough. MA). High stringency hybridization and wash conditions 
were as described by Walling et al (Walling, L L, et al.. Nucleic Acids Res. 
16:10477-10492 (1988)). Low stringency hybridization was in 6X SSPE. 0.5% 
SDS, 0.25% low fat dried milk at 60 °C. Low stringency washes were in IX 
SSPE. 0.5% SDS at 50 °C. Probes used for the entire CBF1 coding sequence 
and 3' end of CBF1 were the Sc/l/Sg/ll and EcoRV/BglW restriction fragments 
from pACT-1 1, respectively, that had been gel purified (Sambrook et al (1989)). 
DNA probes were radiolabeled with '^P-nucleotides by random priming 
(Sambrook). Autoradiography was performed using hyperfilm-MP (Amersham. 
Ariington Heights, IL). Radioactivity was quantified using a Betascope 603 blot 
analyzer (Betagen Corp., Waltham, MA). 

^- Screen of Arabidopsis cDNA library for sequence 
encoding a C-repeat/DRE binding domain. 

The "one-hybrid" strategy (Li. J. J. and I. Herskowitz, Science 
262:1870-1874 (1993)) was used to screen for Arabidopsis cDNA clones 
encoding a C-repeat/DRE binding domain. In brief, yeast strains were 
constructed that contained a lacZ reporter gene with either wild-type or mutant 
C-repeat/DRE sequences in place of the normal UAS (upstream activator 
sequence) of the GAL1 promoter. 

Figures 1 A and 1 B show how the yeast reporter strains were 
constructed. Figure 1A is a schematic diagram showing the screening 
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Strategy. Yeast reporter strains were constructed that carried C-repeat/DRE 
sequences as UAS elements fused upstream of a lacZ reporter gene with a 
minimal GAL1 promoter. The strains were transformed with an Arabidopsis 
expression library that contained random cDNA inserts fused to the GAL4 
activation domain (GAL4-ACT) and screened for blue colony formation on X- 
gal-treated filters. Figure IB is a chart showing activity of the "positive" cDNA 
clones in yeast reporter strains. The oligonucleotides (oligos) used to make the 
UAS elements, and their number and direction of insertion, are indicated by the 
arrows. 

Yeast strains carrying these reporter constructs produced low levels of 
beta-galactosidase and formed white colonies on filters containing X-gal. The 
reporter strains carrying the wild-type C-repeat/DRE sequences were 
transformed with a DNA expression library that contained random Arabidopsis 
cDNA inserts fused to the acidic activator domain of the yeast GAL4 
transcription factor, "GAL4-ACr' (Figure 1A). The notion was that some of the 
clones might contain a cDNA insert encoding a C-repeat/DRE binding domain 
fused to GLA4-ACT and that such a hybrid protein could potentially bind 
upstream of the lacZ reporter genes carrying the wild type C-repeat/DRE 
sequence, activate transcription of the /acZgene and result in yeast forming 
blue colonies on X-gal-treated filters. 

Upon screening about 2x10^ yeast transformants, three "positive" 
CDNA clones were isolated; i.e., clones that caused yeast strains carrying la.-Z 
reporters fused to wild-type C-repeat/DRE inserts to form blue colonies on X- 
gal-treated filters (Figure 1 B). The three cDNA clones did not cause a yeast 
strain carrying a mutant C-repeat/DRE fused to LacZ to turn blue (Figure 1 B). 
Thus, activation of the reporter genes by the cDNA clones appeared to be 
dependent on the C-repeat/DRE sequence. Restriction enzyme analysis and 
DNA sequencing indicated that the three cDNA clones had an identical 1.8 kb 
insert (Figure 2A). One of the clones, designated pACT-11, was chosen for 
further study. 
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D. Identification of 24 kDa polyp ptide with an AP2 
domain encoded by pACT-1 1 . 

Figures 2A, 2B, 2C and 2D provide an analysis of the pACT-11 cDNA 
clone. Figure 2A is a schematic drawing of the pACT-1 1 cDNA insert indicating 
the location and 5' to 3' orientation of the 24 kDa polypeptide and 25s rRNA 
sequences. The cDNA insert was cloned into the Xho\ site of the pACT vector. 
Figure 2B is a DNA and amino acid sequence of the 24 kDa polypeptide (SEQ 
ID N0:1 and SEQ ID NO:2). The AP2 domain is indicated by a double 
underline. The basic amino acids that potentially act as a nuclear localization 
signal are indicated with asterisks. The Bell site immediately upstream of the 
24 kDa polypeptide used in subcloning the 24 kDa polypeptide and the EcoRV 
site used in subcloning the 3' end of CBF1 are indicated by single underlines. 
Figure 2C is a schematic drawing indicating the relative positions of the 
potential nuclear localization signal (NLS), the AP2 domain and the acidic 
region of the 24 kDa polypeptide. Numbers indicate amino acid residues. 
Figure 2D is a chart showing comparison of the AP2 domain of the 24 kDa 
polypeptide with that of the tobacco DNA binding protein EREBP2 (Okme- 
Takagi. M., et al.. The Plant Cell 7:173-182 (1995) SEQ ID NOS: 10 and 1 1). 
Identical amino acids are indicated with single lines; similar amino acids are 
indicated by double dots; amino acids that are invariant in AP2 domains are 
indicated with asterisks (Klucher. K. M.. et a!., The Plant Cell 8:137-153 
(1996)); and the histidine residues present in CBFI and TINY (Wilson. K., et 
al., The Plant Cell 8:659-671 (1996)) that are tyrosine residues in all other 
described AP2 domains are indicated with a caret. A single amino acid gap in 
the CBFI sequence is indicated by a single dot. 

Our expectation was that the cDNA insert in pACT-1 1 would have a C- 
repeat/DRE binding domain fused to the yeast GAL4-ACT sequence. 
However, DNA sequence analysis indicated that an open reading frame of only 
nine amino acids had been added to the C-terminus of GAL4-ACT. It seemed 
highly unlikely that such a short amino acid sequence could comprise a DNA 
binding domain. Also surprising was the fact that about half of the cDNA insert 
in pACT-1 1 corresponded to 25s rRNA sequences (Figure 2A). Further 
analysis, however, indicated that the insert had an open reading frame, in 
opposite orientation to the GAL4-ACT sequence, deduced to encode a 24 kDa 
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polypeptide (Figure 2A-C). The polypeptide has a basic region that could 
potentially serve as a nuclear localization signal (Raikhel, N., Plant Physiol. 
100:1627-1632 (1992)) and an acidic C-terminal half (pi of 3.6) that could 
potentially act as an acidic transcription activator domain (Hahn. S., Cell 
72:481-483 (1993)). A search of the nucleic acid and protein sequence 
databases indicated that there was no previously described homology of the 24 
kDa polypeptide. However, the polypeptide did have an AP2 domain (Jofuku, 
K. D., et al., The Plant Cell 6:1211-1225 (1994)) (Figures 2B, D), a DNA 
binding motif of about 60 amino acids (Ohme-Takagi. M., et a!.. The Plant Cell 
7:173-182 (1994)) that is present in numerous plant proteins including the 
APETALA2 (Jofuku, K. D., et al., The Plant Cell 6:121 1-1225 (1994)), 
AINTEGUMENTA (Klucher. K. M.. et al.. The Plant Cell 8:137-153 (1996); 
Elliot, R. C, et al., The Plant Cell 8:155-168 (1996)) and TINY (Wilson, K., et 
al., The Plant Cell 8:659-671 (1996)) proteins of Arabidopsis and the EREBPs 
(ethylene response element binding proteins) of tobacco (Ohme-Takagi, M., et 
al., The Plant Cell 7:173-182 (1995)). 

E. 24 kDa polypeptide binds to the C-repeat/DRE 
and activates transcription in yeast. 

We hypothesized that the 24 kDa polypeptide was responsible for 
activating the lacZ reporter genes in yeast. To test this, the Bcl\-Bgl\\ fragment 
of pACT-1 1 containing the 24 kDa polypeptide, and the Sg/ll-Sg/ll fragment 
containing the 24 kDa polypeptide plus a small portion of the 25s rRNA 
sequence, was inserted into the yeast expression vector pDB20.1 

Figure 3 is a chart showing activation of reporter genes by the 24 kDa 
polypeptide. Restriction fragments of pACT-1 1 carrying the 24 kDa polypeptide 
(Bcl\-Bgl\l) or the 24 kDa polypeptide plus a small amount of 25s RNA 
sequence (eg/ll-8g/ll) were inserted in both orientations into the yeast 
expression vector pDB20.1 (see Figure 2A and 2B for location of Bcl\ and BglU 
restriction sites). These "expression constructs" were transformed into yeast 
strains carrying the lacZ reporter gene fused to direct repeat dimers of either 
the wild-type COR15a C-repeat/DRE (oligonucleotide MT50) or the mutant 
M2COR15a C-repeat/DRE (oligonucleotide MT80). The specific activity of 
beta-galactosidase (nmoles o-nitrophenol produced/min ' x mg protein ■^) was 
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determined from cultures grown in triplicate. Standard deviations are indicated. 
Abbreviations: pADCI, ADC^ promoter tADCl, ADC1 terminator. 

Piasmids containing either insert in the same orientation as the ADC1 
promoter stimulated synthesis of beta-galactosidase when transformed into 
yeast strains can7ing the lacZ reporter gene fused to a wild-type C0R15a C- 
repeat/DRE (Figure 3). The piasmids did not. however, stimulate synthesis of 
beta-galactosidase when transformed into yeast strains carrying lacZ fused to a 
mutant version of the COR15a C-repeat/DRE (Figure 3). These data indicated 
that the 24 KDa polypeptide could bind to the wild-type C-repeat/DRE and 
activate expression for the lacZ reporter gene in yeast. Additional experiments 
indicated that the 24 kDa polypeptide could activate expression of the lacZ 
reporter gene fused to either a wild-type COR78 C-repeat/DRE (dimer of 
MT66) or a wild-type COR15b C-repeat/DRE (dimer of MT 68) (not shown). A 
plasmid containing the Bcl\.Bgl\\ fragment (which encodes only the 24 KDa 
polypeptide) cloned in opposite orientation to the ADC1 promoter did not 
stimulate synthesis of beta-galactosidase in reporter strains carrying the wild- 
type C0R15a C-repeat/DRE fused to lacZ (Figure 3). In contrast, a plasmid 
carrying the 5g/II-Sg/li fragment (containing the 24 kDa polypeptide plus some 
25s rRNA sequences) cloned in opposite orientation to the ADC1 promoter 
produced significant levels of beta-galactosidase in reporter strains carrying 
the wild-type C0R15a C-repeat/DRE (Figure 3). Thus, a sequence located 
closely upstream of the 24 kDa polypeptide was able to serve as a cryptic 
promoter in yeast, a result that offered an explanation for how the 24 kDa 
polypeptide was expressed in the original pACT-11 clone. 

F. Gel shift analysis indicates that the 

24 kDa polypeptide binds to the C-repeat/DRE. 

Gel shift experiments were conducted to demonstrate further that the 24 
kDa polypeptide bound to the C-repeat/DRE. Specifically, the open reading 
frame for the 24 kDa polypeptide was inserted into the pET-28a(+) bacterial 
expression vector (see Materials and Methods) and the resulting 28 kDa fusion 
protein was expressed at high levels in E. coli. (Figure 4). 

Figure 4 is a photograph of an electrophoresis gel showing expression 
of the recombinant 24 kDa polypeptide in E. coli. Shown are the results of 
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SDS-PAGE analysis of protein extracts prepared from £ co// harboring either 
the expression vector alone (vector) or the vector plus an insert encoding the 
24 KDa polypeptide in sense (sense insert) or antisense (antisense insert) 
orientation. The 28 kDa fusion protein (see Materials and Methods) is 
indicated by an arrow. 

Figure 5 is a photograph of a gel for shift assays indicating that CBF1 
binds to the C-repeat/DRE. The C-repeat/DRE probe (1 ng) used in all 
reactions was a "P-labeled dimer of the oligonucleotide MT50 (wild type C- 
repeat/DRE from COR15a). The protein extracts used in the first four lanes 
were either bovine serum albumin (8SA) or the indicated CBF1 sense, 
antisense and vector extracts described in Figure 4. The eight lanes on the 
right side of the figure used the CBF1 sense protein extract plus the indicated 
competitor C-repeat/DRE sequences (100 ng). The numbers 1X. 2X and 3X 
indicate whether the oligonucleotides were monomers, dimers or trimers, 
respectively, of the indicated C-repeat/DRE sequences. 

Protein extracts prepared from E. co// expressing the recombinant 
protein produced a gel shift when a wild-type COR15a C-repeat/DRE was used 
as probe (Figure 5). No shift was detected with BSA or E. co// extracts 
prepared from strains harboring the vector alone, or the vector with an 
antisense insert for the 24 kDa polypeptide. Oligonucleotides encoding wild- 
type C-repeat/DRE sequences from COR15a or COR78 competed effectively 
for binding to the C0R15a C-repeat/DRE probe, but mutant version of the 
COR15a C-repeat/DRE did not (Figure 5). These in vitro results corroborated 
the ,n VIVO yeast expression studies indicating that the 24 kDa polypeptide 
binds to the C-repeat/DRE sequence. The 24 kDa polypeptide was thus 
designated CBF1 (C-repeat/DRE binding factor 1) and the gene encoding it 
named CBF1. 



CBF1 is a unique or low copy number 



gene. 



Figure 6 is a photograph of a southern blot analysis indicating CBF1 is 
a unique or low copy number gene. Arabidopsis DNA ( 1 pg) was digested with 
the indicated restriction endonucleases and southern transfers were prepared 
and hybridized with a -P-labeled probe encoding the entire CBF1 polypeptide 
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The hybridization patterns observed in southern analysis of Arabidopsis 
DNA using the entire CBF1 gene as probe were relatively simple indicating that 
CBF1 is either a unique or low copy number gene (Figure 6). The hybridization 
patterns obtained were not altered if only the 3' end of the gene was used as 
the probe (the EcoRV/Bgl\\ restriction fragment from pACT-1 1 encoding the 
acidic region of CBF1 . but not the AP2 domain) or if hybridization was carried 
out at low stringency (not shown). 

H. CBF1 transcript level response to low temperature. 

Figures 7A, 7B and 7C relate to CBF1 transcripts in control and cold- 
treated Arabidopsis. Figure 7A is a photograph of a membrane RNA isolated 
from Arabidopsis plants that were grown at 22 °C or grown at 22 °C and 
transferred to 2.5 °C for the indicated times. Figures 7B and 7C are graphs 
showing relative transcript levels of CBF1 and COR15a in control and cold- 
treated plants. The radioactivity present in the samples described in Figure 7A 
were quantified using a Betascope 603 blot analyzer and plotted as relative 
transcript levels (the values for the 22 °C grown plants being arbitrarily set as 
1) after adjusting for differences in loading using the values obtained with the 
pHH25 probe. 

Based on Figures 7A-7C. northern analysis indicated that the level of 
CBF1 transcripts increased about 2 to 3 fold in response to low temperature 
(Figure 7B). In contrast, the transcript levels for COR15a increased 
approximately 35 fold in cold-treated plants (Figure 7C). Only a singly 
hybridizing band was observed for CBF1 at either high or low stringency with 
probes for either the entire CBF1 coding sequence or the 3' end of the gene 
(the EcoRV/fig/ll fragment of pACT-1 1) (not shown). The size of the CBF1 
transcripts was about 1.0 kb. 

I. Discussion Of Experimental Results. 

The above example regarding CBF1 represents the first identification of 
a gene sequence which encodes a protein capable of binding to the C- 
repeat/DRE sequence CCGAC. The experimental results presented evidence 
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that CBF1 binds to the C-repeat/DRE both in vitro via gel shift assays and in 
v,vo via yeast expression assays. Further, the results demonstrate that CBF1 
can activate transcription of reporter genes in yeast that contain the C- 
repeat/DRE. 

The results of the southern analysis indicate that CBF1 is a unique or 
low copy number gene in Arabidopsis. However, the CBF1 protein contains a 
60 amino acid motif, the AP2 domain, that is evolutionary conserved in plants 
(Weigel, D.. The plant Cell 7:388-389 (1995)). It is present in the APETALA2 
(Jofuku. K. D.. et al.. The Plant Cell 6:121 1-1225 (1994)). AINTEGUMENTA 
(Klucher. K. M., et al.. the Plant Cell 8:137-153 (1996; and Elliot. R C et al 
The Plant Cell 8:155-168 (1996)). TINY (Wilson. K.. et al.. The Plant Cell 
8:659-671 (1996)) and cadmium-induced (Choi, S.-Y.. et al.. Plant Physiol 
108:849 (1995)) proteins of Arabidopsis and the EREBPs of tobacco (Ohme- 
Takagi. M. et al.. The Plant Cell 7:173-182 (1995)). In addition, a search of the 
GenBank expressed sequence tagged cDNA database indicates that there is 
one CDNA from B. napus, two from Ricinus communis, and more than 25 from 
Arabidopsis and 15 from rice, that are deduced to encode proteins with AP2 
domains. The results of Ohme-Takagi and Shinshi (Ohme-Takagi M et al 
The Plant Cell 7:173-182 (1995)) indicate that the function of the AP2 domain 
.s DNA-binding; this region of the putative tobacco transcription factor EREBP2 
is responsible for its binding to the c/s-acting ethylene response element 
referred to as the GCC-repeat. As discussed by Ohme-Takagi and Shinshi 
(Ohme-Takagi. M.. et al.. the Plant Cell 7:173-182 (1995)). the DNA-binding 
domain of EREBP2 (the AP2 domain) contains no significant amino acid 
sequence similarities or obv.ous structural similarities with other known 
transcnption factors or DNA binding motifs. Thus, the domain appears to be a 
novel DNA-binding motif that to date, has only been found in plant proteins 

It is believed that the binding of CBF1 to the C-repeat/DRE involves the 
AP2 domain. In this regard, it is germane to note that the tobacco ethylene 
response element. AGCCGCC, closely resembles the C-repeat/DRE 
sequences present in the promoters of the Arabidopsis genes COR15a 
GGCCGAC, and COR78/RD29A. TACCGAC. Applicants believe that CBF1 
the EREBPs and other AP2 domain proteins are members of a superfamily of 
DNA binding proteins that recognize a family of c/s-acting regulatory elements 
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having CCG as a common core sequence. Differences in the sequence 
surrounding the CCG core element could result in recruitment of different AP2 
domain proteins which, in turn, could be integrated into signal transduction 
pathways activated by different environmental, hormonal and developmental 
cues. Such a scenario is akin to the situation that exists for the ACGT-family of 
c/s-acting elements (Foster et ai., FASEB J. 8:192-200 (1994)). In this case 
differences in the sequence surrounding the ACGT core element result in the 
recruitment of different bZIP transchption factors involved in activating 
transcription in response to a variety of environmental and developmental 
signals. 

The results of the yeast transfomiation experiments indicate that CBF1 
has a domain that can serve as a transcriptional activator. The most likely 
candidate for this domain is the acidic C-temiinal half of the polypeptide 
Indeed, random acidic amino acid peptides from E. co//have been shown to 
substitute for the GAL4 acidic activator domain of GAL4 in yeast (Ma, J. and M 
Ptashne, Cell 51:113-199 (1987)). Moreover, acidic activator domains have 
been found to function across kingdoms (Hahn, S.. Cell 72:481-483 (1993))- 
the yeast GAL4 acidic activator, for instance, can activate transcription in 
tobacco (Ma, J., et al.. Nature 334:631-633 (1988)). It has also been shown 

that certain plant transcription factors, such as Vpl (Mccarty D R etal Cell 
66:895-905 (1991)). have acidic domains that function as transcriptional " 
activators in plants. Significantly, the acidic activation domains of the yeast 
transcription factors VP16 and GCN4 require the "adaptor proteins ADA2 
ADA3, and GCN5 for full activity (see Guarente. L., Trends Biochem Sci ' 
20:517-521 (1995)). These proteins form a heteromeric complex (Horiuchi J 
et al., Mol. Cell Biol. 15:1203-1209 (1995)) that bind to the relevant activation" 
domains. The precise mechanism of transcriptional activation is not known but 
appears to involve histone acetylation: there is a wealth of evidence showing a 
positive correlation between histone acetylation and the transcriptional activity 
of Chromatin (Wolffe. A. P., Trends Biochem. Sci. 19:240-244 (1994)) and 
recently, the GCN5 protein has been shown to have histone acetyltransferase 
activity (Brownell, J. E.. et al., Cell 84:843-851 (1996)). Genetic studies 
mdicate that CBF1. like VP16 and GCN4. requires ADA2. ADA3 and GCN5 to 
function optimally in yeast. The fundamental question thus raised is whether 
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plants have homologs of ADA2. ADA3 and GCN5 and whether these adaptors 
are required for CBF1 function (and function of other transcription factors with 
acidic activator regions) in Arabidopsis. 

A final point regards regulation of C8F1 activity. The results of the 
northern analysis indicate that CBF1 transcript levels increase only slightly in 
response to low temperature, while those for C0Rl5a increase dramatically 
(Fig. 7). Thus, unlike in yeast, it would appear that transcription of CBF1 in 
Arabidopsis at wami temperatures is not sufficient to cause appreciable 
activation of promoters containing the C-repeat/DRE. The molecular basis for 
this apparent low temperature activation of CBF1 in Arabidopsis is not known 
One intriguing possibility, however is that CBF1 might be modified at low 
temperature in Arabidopsis resulting in either stabilization of the protein 
translocation of the protein from the cytoplasm to the nucleus, or activation of 
either the DNA binding domain or activation domain of the protein Such 
modification could involve a signal transduction pathway that is activated by 
low temperature. Indeed, as already discussed, cold-regulated expression of 
COR genes in Arabidopsis and alfalfa appears to involve a signal transduction 
pathway that is activated by low temperature-induced calcium flux (Knight H 
et ai., The Plant Cell 8:489-503 (1996); Knight. M. R.. et al.. Nature 352-524- ' 
526 (1991); Monroy, A. F., et al, Plant Physiol. 102:1227-1235 (1993)- Monroy 
A. F., and R. S., The Plant Cell. 7:321-331 (1995)). It will, therefore, be of 
interest to determine whether CBF1 is modified at low temperature, perhaps by 
phosphorylation, and if so, whether this is dependent on calcium-activated 
signal transduction. 



2. Use of CBF1 To Induce Cold Regulated Gene 

Expression in Nonacclimated Arabidopsis Plants. 

The following example demonstrates that increased expression of 
CBF1 induces COR gene expression in nonacclimated Arabidopsis plants 
Transgenic Arabidopsis plants that overexpress CBF1 were created by placing 
a cDNA encoding CBF1 under the control of the strong cauliflower mosaic virus 
(CaMV) 35S promoter and transforming the chimeric gene into Arabidopsis 
ecotype RLD plants (Standard procedures were used for plasmid 
manipulations (J. Sambrook, et al.. Molecular Cloning. A Laboratory Manual 
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(Cold Spring Harbor Laboratory Press. Cold Spring Harbor, ed. 2. (1989)) The 

CBFl-containlng^sel-eg/il fragment from pACT-Bgi + (Stockinger E J et al 
Proc. Natl. Acad. Sci. U.S.A. 94:1035 (1997)) was gel-purified, SamHI linkers ' 
were ligated to both ends and the fragment was inserted into the SamHI site in 
PCIB710 (S. Rothstein. et al., Gene 53:153-161 (1987)) which contains the 
CaMV 35S promoter and terminator. The chimeric plasmid was linearized at 
the Kpn\ site and inserted into the Kpnl site of the binary vector pCIB1 Og (Ciba- 
Geigy. Research Triangle Park, NC). The plasmid was transfomied into 
Agrobacterium tumefaciens strain C58C1 (pMPGO) by electroporation 
Arabidopsis plants were transfomied by the vacuum infiltration procedure (N 
Bechtold. J. Ellis, and G. Pelletier, C. R. Acad. Sci. Paris, Life Sci 3161 194- 
1199 (1993)) as modified (A. van Hoof, P. J. Green, Plant Journal 10-415-424 
(1996)). Initial screening gave rise to two transgenic lines, A6 and BIG that 
accumulated CBF1 transcripts at elevated levels. 

Figure 8 is a Northern blot showing CBF1 and COR transcript levels in 
RLD and transgenic Arabidopsis plants. Leaves from nonacclimated and 
three-day cold-acclimated plants {Arabidopsis thaliar^a ecotype RLD plants 
were grown in pots under continuous light ( 100 ME/m^/sec) at 22 C for 18-25 
days as described (Gilmour, S. J., et al.. Plant Physiol. 87:735 (1988)) In 
some cases, plants were then cold-acclimated by placing them at 2.5 "C under 
continuous light ( 50 ,E/m^/sec) for varying amounts of time. Leaves were 
harvested and total RNA prepared and analyzed for CBF1 and COR transcripts 
by RNA blot analysis using -P-radiolabeled probes (Total RNA was isolated 
from plant leaves and subjected to RNA blot analysis using high stringency 
hybnd.zation and wash conditions as described (E.J. Stockinger, et al Proc 
Natl. Acad. Sci. USA 94:1035 (1997); and S.J. Gilmour, et al.. Plant Physiol ' 
87:735 (1988)). 

Figure 9 is an immunoblot showing C0R15sm protein levels in RLD 
and transgenic Arabidopsis plants. Total soluble protein (100 pg) was 
prepared from leaves of the nonacclimated RLD (RLDw) 4-day cold- 
acclimated RLD (RLDC4), 7-day cold-acclimated RLD (RLDc7) and 
nonacclimated A6 and B16 plants and the levels of C0R15arn determined by 
.mmunoblot analysis using antiserum raised against the COR15^m polypeptide 
(Total soluble protein was isolated from plant leaves, fractionated by tricine 
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SDS-PAGE and transferred to 0.2 micron nitrocellulose as previously described 
(N. N. Artus et al.. Proc. Natl. Acad. Sci. U.S.A. 93:13404 (1996)). C0R15am 
prote,n was detected using antiserum raised to purified CORISam and protein 
A conjugated alkaline phosphatase (Sigma. St. Louis. MO) (N. N. Artus etal 
Proc. Natl. Acad. Sci. U.S.A. 93:13404 (1996)). No reacting bands were 
observed with preimmune serum (not shown). 

Southern analysis indicated that the A6 line had a single DNA insert 
while the B16 line had multiple inserts (not shown). Examination of fourth 
generation homozygous A6 and B16 plants indicated that CBF1 transcript 
levels were higher in nonacclimated A6 and 816 plants than they were in 
nonacclimated RLD plants, the levels in A6 being about three fold higher than 
in B16 (Figure 8). 

CBF1 overexpression resulted in strong induction of COR gene 
expression (Figure 8). Specifically, the transcript levels of C0R6 6 COR15a 
COR47 and COR78 were dramatically elevated in nonacclimated A6 and B16 
plants as compared to nonacclimated RLD plants. The effect was greater in 
the A6 line, where COR transcript levels in nonacclimated plants approximated 
those found in cold-acclimated RLD plants. The finding that COR gene 
expression was greater in A6 plants than in B16 plants was consistent with 
CBF1 transcript levels being higher in the A6 plants (Figure 7A). Immunoblot 
analysis indicated that the levels of the CORISam (Figure 9) and COR6 6 (not 
Shown) polypeptides were also elevated in the A6 and B16 lines, the level of 
expression again being higher in the A6 line. Attempts to identify the CBF1 
protein in either RLD or transgenic plants were unsuccessful. Overexpression 
Of CBF1 had no effect on the transcript levels for elF4A (eukaryotic initiation 
factor 4A) (Metz, A.M.. et al.. Gene 120:313 (1992)), a constitutively expressed 
gene that is not responsive to low temperature (Figure 8) and had no obvious 
effects on plant growth and development. 

^^^^^^"'t^fr^'" this example demonstrate that overexpression of the 
>*rai„dops/s transcriptional activator CBF1 induces expression of an 
Aratidopsis CO^ "regulon" composed of genes carrying the CRT/DRE DNA 
regulatory element. It appears that CBF1 binds to the CRT/DRE DNA 
regulatory elements present in the promoters of these genes and activates 
transcription which is consistent with the notion of CBF1 having a role in COR 
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gene regulation. Significantly, there was a strong correlation between CBF1 
transcript levels and the magnitude of COR gene induction in nonacclimated 
A6. B16. and RLD plants (Figure 8). However, upon low temperature treatment 
the level of CBF1 transcripts remained relatively low in RLD plants, while COR 
gene expression was induced to about the same level as that in nonacclimated 
A6 plants (Figure 8). Thus, it appears that CBF1 or an associated protein 
becomes "activated" in response to low temperature. 

3. CBF1 Overexpresslon Resulted in a Marked 
Increase in Plant Freezing Tolerance 

The following example describes a comparison of the freezing tolerance 
of nonacclimated Aratidopsis plants which overexpress CBF1 to that of cold- 
acclimated wild-type plants. As described below, the freezing tolerance of 
nonacclimated Arabidopsis plants overexpressing CBF1 significantly exceeded 
that of non-acclimated wild-type Aratidopsis plants and approached that of 
cold-acclimated wild-type plants. 

Freezing tolerance was determined using the electrolyte leakage test 
(Sukumaran. N. P., et al.. HortScience 7:467 (1972)). Detached leaves were 
frozen to various subzero temperatures and, after thawing, cellular damage 
(due to freeze-induced membrane lesions) was estimated by measuring ion 
leakage from the tissues. 

Figures 10A and 10B are graphs showing freezing tolerance of leaves 
from RLD and transgenic Arabidopsis plants. Leaves from nonacclimated RLD 
(RLDw) plants, cold-acclimated RLD (RLDc) plants and nonacclimated A6 B16 
and T8 plants were frozen at the indicated temperatures and the extent of 
cellular damage was estimated by measuring electrolyte leakage (Electrolyte 
leakage tests were conducted as described (N.P.Sukumaran et al 
HortScience 7. 467 (1972); and S.J. Gilmour. et al.. Plant Physiol 87 735 
(1988)) w.th the following modifications. Detached leaves (2-4) from 
nonacclimated or cold-acclimated plants were placed in a test tube and 
submerged for 1 hour in a -2 °C water-ethylene glycol bath in a completely 
randomized design, after which ice crystals were added to nucleate freezing 
After an additional hour of incubation at -2 "C. the samples were cooled in 
decrements of 1 "C each hour until -8 »C was reached. Samples (five 

70 



wo 99/38977 

PCT/US99/01895 

replicates for each data point) were thawed overnight on ice and incubated in 3 
ml distilled water with shaking at room temperature for 3 hours. Electrolyte 
leakage from leaves was measured with a conductivity meter. The solution 
was then removed, the leaves frozen at -80 "C (for at least one hour) and the 
solution returned to each tube and incubated for 3 hours to obtain a value for 
100% electrolyte leakage. In Figures 10A and 10B. the RLDc plants were cold- 
acclimated for 10 and 1 1 days, respectively. Error bars indicate standard 
deviations. 

As can be seen from Figure 10A and 10B, CBF1 overexpression 
resulted in a marked increase in plant freezing tolerance. The experiment 
presented in Figure 10A indicates that the leaves from both nonacclimated A6 
and B16 plants were more freezing tolerant than those from nonacclimated 
RLD plants. Indeed, the freezing tolerance of leaves from nonacclimated A6 
Plants approached that of leaves from cold-acclimated RLD plants. The results 
also indicate that the leaves from nonacclimated A6 plants were more freezing 
tolerant than those from nonacclimated B16 plants, a result that is consistent 
with the greater level of CBF1 and COR gene expression in the A6 line 
The results presented in Figure 10B further demonstrate that the 
freezing tolerance of leaves from nonacclimated A6 plants was greater than 
that of leaves from nonacclimated RLD plants and that it approached the 
freezing tolerance of leaves from cold-acclimated RLD plants. In addition the 
results indicate that overexpression of CBF1 increases freezing tolerance'to a 
much greater extent than overexpressing CORi5a alone. This conclusion 
comes from comparing the freezing tolerance of leaves from nonacclimated A6 
and T8 plants (Figure 10B). T8 plants (Artus. N. N.. et al., Proc. Natl Acad 
Sc.. U.S.A. 93:13404 (1996)) are from a transgenic line that constitutively 
expresses COR15a (under control of the CaMV 35S promoter) at about the 
same level as in A6 plants (Figure 1). However, unlike in A6 plants, other 
CRT/DRE-regulated COR genes are not constitutively expressed in T8 plants 
(Figure 8). 

A comparison of EUo values (the freezing temperature that results in 
release of 50o/o of tissue electrolytes) of leaves from RLD. A6. B16 and T8 
plants is presented in Table 2. 
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ELso values were calculated and compared by analysis of variance 
curves fitting up to third order linear polynomial trends were determined for 
each electrolyte leakage experiment. To insure unbiased predictions of 
electrolyte leakage, trends significantly improving the model fit at the 0 2 
probability level were retained. EL^o values were calculated from the fitted 
models, in Table 2. an unbalanced one-way analysis of variance, adjusted for 
the d,fferent numbers of EL. values for each plant type, was determined using 
SAS PROC GLM [SAS Institute, Inc. (1989). SAS/STAT User's Guide Version 
6, Cory, NC)]. EUo values ± SE (n) are presented on the diagonal line for 
leaves from nonacclimated RLD (RLDw). cold-acclimated (7 to 10 days) RLD 
(RLDc) and nonacclimated A6. B16 and T8 plants. P values for comparisons 
of ELso values are indicated in the intersecting cells. 

TABLE 2 







ELso values — 


RLDw 


RLDw 


RLDc 


. A6 


B16 


T8 


RLDc 


-3.9±0.21 
(8) 


P<0.0001 


H<aoooi 


P=0.0014 


P=0.7406 


A6 




-7.6 ±0.30 
(4) 


P=0.3261 


P<0.0001 


P<0.0001 


B16 






-7.2+0.25 
(6) 


P<0.0001 


P<0.0001 


T8 








-5.2±0.27 
(5) 


P=0.0044 


i 










-3.8±0.35 
(3) 





The data confimi that: 1) the freezing tolerance of leaves from both 
nonacclimated A6 and B16 plants is greater than that of leaves from both 
nonacclimated RLD and T8 plants; and 2) that leaves from nonacclimated A6 
Plants are more freezing tolerant than leaves from nonacclimated B16 plants 
No s,gn.ficant difference was detected in EL. values for leaves from 
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nonacclimated A6 and cold-acclimated RLD plants or from nonacclimated RLD 
and T8 plants. 

The enhancement of freezing tolerance in the A6 line was also apparent 
at the whole plant level. Figure 1 1 is a photograph showing freezing survival of 
RLD and A6 Arabidopsis plants. Nonacclimated (WARM) RLD and A6 plants 
and 5-day cold-acclimated (COLD) RLD plants were frozen at -5 °C for 2 days 
and then returned to a growth chamber at 22 °C (Pots (3.5 inch) containing 
about 40 nonacclimated Arabidopsis plants (20 day old) and 4 day cold- 
acclimated plants (25 days old) {Arabidopsis thaliana ecotype RLD plants were 
grown in pots under continuous light ( 100 pE/m^/sec) at 22 *C for 18-25 days 
as described (S.J. Gilmour, et al., Plant Physiol. 87:735 (1988)). In some 
cases, plants were then cold-acclimated by placing them at 2.5 °C under 
continuous light ( 50 pE/m^/sec) for varying amounts of time) were placed in a 
completely randomized design in a -5 °C cold chamber in the dark. After 1 
hour, ice chips were added to each pot to nucleate freezing. Plants were 
removed after 2 days and returned to a growth chamber at 22 °C.). A 
photograph of the plants after 7 days of regrowth is shown. 

Although the magnitude of the difference varied from experiment to 
experiment, nonacclimated A6 plants consistently displayed greater freezing 
tolerance in whole plant freeze tests than did nonacclimated RLD plants 
(Figure 11). No difference in whole plant freeze survival was detected between 
nonacclimated B16 and RLD plants or nonacclimated T8 and RLD plants (not 
shown). 

The results of this experiment show that CBF1 -induced expression of 
CRT/DRE-regulated COR genes result in a dramatic increase in freezing 
tolerance and confirms the belief that COR genes play a major role in plant 
cold acclimation. The increase in freezing tolerance brought about by 
expressing the battery of CRT/DRE-regulated COR genes was much greater 
than that brought about by overexpressing COR15a alone indicating that COR 
genes in addition to COR15a have roles in freezing tolerance. 

Traditional plant breeding approaches have met with limited success in 
improving the freezing tolerance of agronomic plants (Thomashow. M. F., Adv. 
Genet 28:99 (1990)). For instance, the freezing tolerance of the best wheat 
varieties today is essentially the same as the most freezing-tolerance varieties 
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developed in the early part of this century. Thus, in recent years there has 
been considerable int rest that biotechnology might offer new strategies to 
.mprove the freezing tolerance of agronomic plants. By the results of the 
present invention. Applicants demonstrate the ability to enhance the freezing 
tolerance of nonacclimated Arabidopsis plants by increasing the expressing of 
the Arab,dopsis regulatory gene CBF1. As described throughout this 
application, the ability of the present invention to modify the expression of 
environmental stress tolerance genes such as core genes has wide ranging 
implications since the CRT/DRE DNA regulatory element is not limited to 
Arab,dopsis (Jiang C. et al., Plant Moi. Biol. 30:679 (1996)). Rather CBF1 
and homologous genes can be used to manipulate expression of CRT/DRE- 
regulated COR genes in important crop species and thereby improve their 
freezing tolerance. By transforming modified versions of CBF1 (or homologs) 
.mo such Plants, it will extend their safe growing season, increase yield and 
expand areas of production. 

4. Selection Of Promoters To Control Expression of CBF1 In Plants 

The following examples describe the isolation of different promoters 
from plant genomic DNA, construction of the plasmid vectors carrying the 
CBF1 gene and the inducible promoters, transformation of Arabidoposis 
cells/plants with these constructs, and regeneration of transgenic plants with 
increased tolerance to environmental stresses. 

A. Isolation of inducible promoters from plant genomic DNAs 

inducible promoters from different plant genomic DNAs were identified 
and isolated by PCR amplification using primers designed to flank the promoter 
region and contain suitable restriction sites for cloning into the expression 
vector. The following genes were used to BLAST search Genbank to find the 
rnducible promoters: Dreb2a; P5CS; Rd22; Rd29a; Rd29b: Rab18; Cor47 
Table 3 l.sts the accession numbers and positions of these promoters. Table 4 
l.sts the fon^ard and reverse primers that were used to isolate the promoters 
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TABLE 3 



Gene Name 


Accession No. 


Position 


Length (bps) 


Dreb2a 


AB010692 


51901-53955 


2054 


P5CS 


AC003000 


45472-47460 


1988 


Rd22 


D 10703 


17-1046 


1029 


Rd29a 


D 13044 


3870-5511 


1641 


Rd29b 


D13044 


90-1785 


1695 


Rab18 


AB013389 


8070-9757 


1687 


Cor47 


AB004872 


1-1370 


1370 



TABLE 4 



Promoter name 


Primer name 


Cloning sites 


SEQ.fD.No. 


Dreb2a 


Dreb2a-reverse 


Hindlll (AAGCTT) 


19 




Dreb2a-forward 


Bglll (AGATCT) 


20 


P5CS 


P5CS-reverse 


Hindlll (AAGCTT) 


21 




P5CS-foward 


Bglll (AGATCT) 


22 


Rd22 


Rd22-reverse 


Hindlll (AAGCTT) 


23 




Rd22-forward 


Kpnl (GGTACC) 


24 


Rd29a 


Rd29a-reverse 


Hindlll (AAGCTT) 


25 




Rd29a-forward 


Kpnl (GGTACC) 


26 


Rd29b 


Rd29b-reverse 


Hindlll (AAGCTT) 


27 


Rab18 


Rd29b-forward 


Kpnl (GGTACC) 


28 


Rab18-reverse 


Hindlll (AAGCTT) 


29 




Rab18-forward 


Bglll (AGATCT) 


30 


Cor47 


Cor47-reverse 


Hindlll (AAGCTT) 


31 




Cor47-forward 


Bglll (AGATCT) 


32 



(1) Dreb2a promoter 

A cDNA encoding DRE (C-repeat) binding protein (DREB2A) has been 
recently identified (Liu, et al. 1998 Plant Cell 10:1391-1406). The transcription 
of the DREB2A gene is activated by dehydration and high-salt stress, but not 
by cold stress. The upstream untranslated region (166 bps) of dreb2a was 
used to BLAST-search the public database. A region containing the DREB2A 
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promoter was identified in chromosome 5 of Arabidopsis (Accession No. 
AB010692) between nucleotide positions 51901-53955 (Table 3). 

Two PCR primers designed to amplify the promoter region from 
Arabidopsis ttialiana genomic DNA are as follows: (:/red2a-reverse: 
5'-GCCCA4GC7TCAAGTTTAGTGAGCACTATGTGCTCG-3' [SEQ ID No. 19); 
and drei)2a-forward: 5'-GGA4Ga7:crCCTTCCCAGAAACAACACAATCTAC-3" 
[SEQ. ID. No. 20]. The c/re2f)a-reverse primer includes a Hind III (AAGCTT ) 
restriction site near the 5'-end of the primer and dre62a-fonward primer has a 
Bgl II ( AGATCT ) restriction site at near 5'-end of the primer. These restriction 
sites may be used to facilitate cloning of the fragment into an expression 
vector. 

Total genomic DNA may be isolated from Arabidopsis thaliana (ecotype 
Colombia) by using the CTAB method (Ausubel et al. (1992) Current Protocols 
in Molecular Biology (Greene & Wiley. New York)). Ten nanograms of the 
genomic DNA can be used as a template in a PCR reaction under conditions 
suggested by the manufacturer (Boehringer Mannheim). The reaction 
conditions that may be used in this PCR experiment are as follows: Segment 1: 
94°C, 2 minutes; Segment 2: 94°C, 30 seconds; 60°C, 1 minute; 72°C, 3 
minutes, for a total of 35 cycles; Segment 3: 72°C for 10 minutes. A PCR 
product of 2054 bp is expected. 

The PCR products can be subject to electrophoresis in a 0.8% agarose 
gel and visualized by ethidium bromide staining. The DNA fragments 
containing the inducible promoter will be excised and purified using a Qiaquick 
gel extraction kit (Qiagen, CA). 

(2) P5CS promoter 

A cDNA for delta 1 -pyrroline-5-carboxylate synthetase (P5CS) has been 
isolated and characterized (Yoshiba, et al., 1995, Plant J. 7:751-760). The 
cDNA encodes an enzyme involved in the biosynthesis of proline under 
osmotic stress (drought/high salinity). The transcription of the P5CS gene was 
found to be induced by dehydration, high salt and treatment with plant hormone 
ABA, while it did not respond to heat or cold treatment. 

A genomic DNA containing a promoter region of P5CS was identified by 
a BU\ST search of Genbank using the upstream untranslated region (106 bps) 
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of the P5CS sequence (Accession No. D32138). The sequence for the P5CS 
promoter is located in the region between from nucleotide positions 45472 to 
47460 (Accession No. AC003000; Table 3). 

Reverse and forward PGR primers designed to amplify this promoter 
region from Arabidopsis thaliana genomic DNA are P5CS-reverse primer 

5^GCCCMGC7TGTTTCATTTTCTCCATGAAGGAGAT-3' [SEQ. ID. No. 21]; 
and P5CS-forward primer 5'- 

GGA^G^TCTTATCGTCGTCGTCGTCTACCAAAACCACAC-3' [SEQ. ID. No. 
22], 

Total genomic DNA may be isolated from Arabidopsis thaliana (ecotype 
Colombia) by using the CTAB method (Ausubel et al. (1992) Cun-ent Protocols 
in Molecular Biology (Greene & Wiley, New York)). Ten nanograms of the 
genomic DNA can be used as a template in a PGR reaction under conditions 
suggested by the manufacturer (Boehringer Mannheim). The PGR product is 
expected to be 1988 bps and may be PGR amplified and gel purified following 
the same protocol described for the dreb2a promoter. 

(3) rd22 promoter 

A cDNA done of rd22 was isolated from Arabidopsis under dehydration 
conditions (Yamaguchi-Shinozaki and Shinozaki, Mol. Gen. Genet. 238:17-25 
(1993)). Transcripts of rd22 were found to be induced by salt stress, water 
deficit and endogenous abscisic acid (ABA) but not by cold or heat stress. A 
promoter region was identified from Genebank by using Nucleotide Search 
WWW Entrez at the NGBI with the rd22 as a search word. The sequence for 
the rd22 promoter is located in the region between nucleotide positions 17 to 
1046 (Accession No. D10703; Table 3), 

Reverse and fonward PGR primers designed to amplify this promoter 
region from Arabidopsis thaliana genomic DNA are rd22-reverse primer 5'- 

GGTGTMGC7TCAGAAGGGGTTCGTTTGGTGG-3' [SEQ. ID. No. 23]; and 
rd22-forward primer 5 - 

GGGGT;ACCTTTTGGGAGTTGGAATAGAAATGGGTTTGATG-3' [SEQ. ID. 
No. 24]. The rd22-reverse primer includes a Hind III (AAGGTT ) restriction site 
near the 5'-end of primer and rd22-forward primer has a Kpnl (GGJACC) 
restriction site at near 5'-end of primer. 
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Total genomic DNA may be isolated from Arabidopsis thaliana 
(ecotype Colombia) by using the CTAB method (Ausubel et al. (1992) Current 
Protocols in Molecular Biology (Greene & Wiley, New York)). Ten nanograms 
of the genomic DNA can be used as a template in a PCR reaction under 
conditions suggested by the manufacturer (Boehringer Mannheim). The PCR 
product is expected to be 1029 bps and may be PCR amplified and gel purified 
following the same protocol described for the drebZa promoter. 

(4) rd29a promoter 

The rd29a and rb29b genes were isolated and characterized by 
Shinozaki's group in Japan (Yamaguchi-Shinizaki and Shinozaki. Plant Physiol. 
101: 1 1 19-1 120 (1993)). Both rd29a and rb29b gene expressions were found 
to be induced by desiccation, salt stress and exogenous ABA treatment 
(Yamaguchi-Shinizaki and Shinozaki, Plant Physiol. 101: 1119-1120 (1993); 
Ishitanietal., Plant Cell 10: 1151-1161 (1998)). The rc/29a gene expression 
was induced within 20 min after desiccation, but rd29b mRNA did not 
accumulate to a detectable level until 3 hours after desiccation. Expression of 
rd29a could also be induced by cold stress, whereas expression of rd29b could 
not be induced by low temperature. 

A genomic clone carrying the rd29a promoter was identified by using 
Nucleotide Search WWW Entrez at the NCBI with the rd29a as a search word. 
The sequence for the rd29a promoter is located in the region between 
nucleotide positions 3870 to 551 1 (Accession No. D13044, Table 3). 

Reverse and fonward primers designed to amplify this promoter region 
from Arabidopsis genomic DNA are: rc/29a-reverse primer 

5'-GCCCMGC7TAATTTTACTCAAAATGTTTTGGTTGC-3' [SEQ. ID. No. 25]; 
and rcf29a-forward primer 5'- 

CCGGi;4CCTTTCCAAAGATTTTTTTCTTTCCAATAGAAGTAATC-3" [SEQ. ID. 
No. 26]. The rd29a-reverse primer includes a Hind III ( AAGCTT ) restriction 
site near the 5'-end of primer and rcy29a-fon(vard primer has a Kpnl (GGTACC ) 
restriction site near 5'-end of primer. 

Total genomic DNA may be isolated from Arabidopsis tfialiana (ecotype 
Colombia) by using the CTAB method (Ausubel et al. (1992) Current Protocols 
in Molecular Biology (Greene & Wiley, New York)). Ten nanograms of the 
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genomic DNA can be used as a template in a PGR reaction under conditions 
suggested by the manufacturer (Boehringer Mannheim). The PGR product is 
expected to be 1641 bps and may be PGR amplified and gel purified following 
the same protocol described for the dreb2a promoter. 

(5) rd29b promoter 

A genomic clone carrying the rd29b promoter was identified by using 
Nucleotide Search WWW Entrez at the NGBI with the rd29b as a search word. 
The sequence for the rd29a promoter was located In the region between 
nucleotide positions 90 to 1785 for rd29b (Accession No. D 13044; Table 3). 

Reverse and fonward PGR primers designed to amplify this promoter 
region from Arabidopsis thaliana genomic DNA are: rd29b-reverse primer 5'- 
GGGGAAGCIICATTTTGTGGTAGAGAAGTG-3' [SEQ. ID. No. 27]; and 
rd29jb-forward primer 5'- 

CGGGLACCTTTGCAAAGGTGTGTTTTGTGTTTTTCAAGTG-S' [SEQ. ID. No. 
28]. 

Total genomic DNA may be Isolated from Arabidopsis thaliana (ecotype 
Colombia) by using the GTAB method (Ausubel et al. (1992) Current Protocols 
in Molecular Biology (Greene & Wiley, New York)). Ten nanograms of the 
genomic DNA can be used as a template in a PGR reaction under conditions 
suggested by the manufacturer (Boehringer Mannheim). The PGR product is 
expected to be 1695 bps and may be PGR amplified and gel purified followir.g 
the same protocol described for the dreb2a promoter. 

(6) rab1 8 promoter 

A rad-reiated (responsive to ABA) gene, rab18 from arabidopsis has 
been isolated. The gene encodes a hydrophilic, glycine-rich protein with the 
conserved serine- and lysine-rich domains. The rajt> 7 8 transcripts accumulate 
in plants exposed to water deficit or exogenous abscisic acid (ABA) treatment. 
A weak induction of rab18 mRNA by low temperature was also observed 
(Ishitani etal., Plant Gell 10: 1151-1161 (1998)). 

A genomic DNA containing a promoter region of rabid was identified by 
a BLAST search of Genbank using the upstream untranslated region (757 bps) 
of the rab18 sequence (Accession No. L04173), The sequence of the rab18 
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promoter is located in the region between nucleotide positions 8070 to 9757 
(Accession No. AB013389). 

Reverse and fonward PGR primers designed and used to amplify this 
promoter region from Arabidopsis thaliana genomic DNA are: raftf 8-reverse 
primer 

5'-GCCCA4GCnCAAATTCTGAATATTCACATATCAAAAAAGTG-3' [SEQ. 
ID. No. 29]; and rab1 d-fonNard primer 5'- 

GGAdGdICrGTTCTTCTTGTCTTAAGCAAACACTTTGAGC-3- [SEQ. ID. No. 
30]. The ra678-reverse primer includes a Hind III ( AAGCTT ) restriction site 
near the 5'-end of the primer and rab18-fonvard primer has a Bgi II (AGATC-p 
restriction site near the 5'-end of the primer. 

Total genomic DNA may be isolated from Arabidopsis thaliana (ecotype 
Colombia) by using the CTAB method (Ausubel et al. (1992) Current Protocols 
in Molecular Biology (Greene & Wiley, New York)). Ten nanograms of the 
genomic DNA can be used as a template in a PGR reaction under conditions 
suggested by the manufacturer (Boehringer Mannheim). The PGR product is 
expected to be 1687 bps and may be PGR amplified and gel purified following 
the same protocol described for the dreb2a promoter. 

(7) Cor47 promoter 

The DNA sequence of cDNA for cold-regulated {cor47) gene of 
Arabidopsis thaliana was determined. Gilmour et al.. Plant Molecular Biology 
18: 13-21 (1992)). Expression of cor47 gene was induced by cold stress, 
dehydration and high NaGI treatment (Ishitani et al., Plant Cell. 10: 1151-1161 
(1998)). The promoter region of cor47 gene was identified in Genbank by 
using Nucleotide Search WWW Entrez at the NGBI with the cor47 as a search 
word. The sequence of the cor47 promoter is located in the region between 
nucleotide positions 1-1370 (Accession No. AB004872; Table 3). 

Reverse and fonward PGR primers designed to amplify this promoter 
region from Arabidopsis thaliana genomic DNA are: cor47-reverse primer 

5'-GCCGMGC7TrGGTGTGTTATGATAGAAGGCAGAAAAGGAG-3' [SEQ. ID. 
No. 31]; and cor4 7-fonward primer 5- 

GGA^G^ICIAGTTAATCTTGATTTGATTAAAAGTTTATATAG-3' [SEQ. ID. 
No. 32]. The cor47-reverse primer includes a Hind III (AAGCTT ) restriction 
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Site near the 5'-end of the primer and cor4 7-forward primer has a Bglll 
(AGATCT ) restriction site near the 5'-end of the primer. 

Total genomic DNA may be isolated from Arabidopsis thaJiana (ecotype 
Colombia) by using the CTAB method (Ausubel et al. (1992) Current Protocols 
in Molecular Biology (Greene & Wiley, New York)). Ten nanograms of the 
genomic DNA can be used as a template in a PGR reaction under conditions 
suggested by the manufacturer (Boehringer Mannheim). The PGR product is 
expected to be 1370 bps and may be PGR amplified and gel purified following 
the same protocol described for the drebZa promoter. 

B. Isolation of seed-specific promoters from plant genomic 
DNAs 

The napin promoter from Brassica campestris (genbank accession 
number M64632) is a seed-specific promoter. A fragment of the napin promoter 
(between nucleotides 1 146 to 2148) is identified and isolated by PGR 
amplification using a 5' PGR primer containing a Hindtll site upstream of the 
promoter and a 3' PGR primer containing a BamHI site downstream of the 
promoter. Deletions of the napin promoter to -21 1 and -152 have been shown 
to have reduced levels of expression (Eilerstrom et al. Plant Moi Biol 32:1019- 
27 (1996); Stalberg et al. Planta 199: 515-9 (1996); Stalberg et al. Plant Mol 
Biol 23: 671-83 (1993)). These 5' deleted promoters are useful to have 
reduced levels of CBF1 expression for applications where the larger napin 
promoter fragment is too strong. 

Other seed-active promoters or deletions of these promoters can also 
be isolated from genomic DNA by using the same method described above for 
the napin promoter. Examples of these promoters include but are not limited to 
the soybean 7S seed storage protein (Ghen et al., Developmental Genetics 
10:112-122 (1989), the bean phaseolin promoter (cited in US Patent No. 
5.003,045). the Arabidopsis 12S globulin (cruiferin) promoter (Pang, et al., 
Plant Mol. Biol. 11:805-820 (1988), the maize globulin 1 promoter (Kriz et al. 
Plant Physiol. 91:636 (1989); US Patent No. 5.773,691). These promoters may 
be used for altering GOR gene expression in cereals such as corn, bariey, 
wheat, rice and rye seeds. 
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C. Construction of the plamids containing CBF1 and Inducible 
or tissue-specific promoter 

The expression binary vector pMEN020 contains a kanamycin 
5 resistance gene (neomycin phosphotransferase) for antibiotic selection of the 
transgenic plants and a Spc/Sfr gene used for bacterial or agrobacterial 
selections. The pMEN020 plasmid is digested with restriction enzymes such 
as Hindlll and Bglll to remove the 35S promoter. The 35S promoter is then 
replaced with an inducible promoter. 

10 The expression binary vector pMENOSO is derived from pMEN020 by 

replacing the Nptll kanamycin resistance gene with the Bar gene (US Patent 
5,646, 024), PMENOSO is digested with Hindlll and BamHI restriction enzymes 
to remove the EcaMV 35S promoter. The EcaMV 35S promoter is then 
replaced with the seed-specific napin promoter, resulting plasmid pMENIOOI. 

15 Similarly, the EcaMV 35S promoter is also replaced with the seed-specific 

napin promoters with -21 1 and -152 end point deletions to generate plasmid 

pMEN1002 and pMEN1003, respectively. 

( 1) Cloning of the inducible promoter into pMEN020 

The sequences of the inducible promoters that are PGR amplified and 

20 gel purified, as well as the plasmid pMEN020, are subject to restriction 

digestion with their respective restriction enzymes as listed in Table 4. Both 
DNA samples are purified by using the Qiaquick purification kit (Qiagen, CA) 
and ligated at a ratio of 3:1 (vector to insert). Ligation reactions using T4 DNA 
ligase (New England Biolabs, MA) are carried out at 16°C for 16 hours. The 

25 ligated DNAs are transformed into competent cells of the E. coli strain DH5 
by using the heat shock method. The transformed cells are plated on LB 
plates containing 100 pg/ml spectinomycin (Sigma). Individual colonies are 
grown overnight in five milliliters of LB broth containing 100 pg/ml 
spectinomycin at 37°C. 

30 Plasmid DNAs from transformants are purified by using Qiaquick Mini 

Prep kits (Qiagen, CA) according to the manufacturer's instruction. The 
presence of the promoter insert is verified by restriction mapping with the 
respective restriction enzymes as listed in Table 4 to cut out the cloned insert. 
The plasmid DNA is also subject to double-strand DNA sequencing analysis 
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using a vector primer {E9.1 primer 5'-CAAACTCAGTAGGATTCTGGTGTGT-3' 
[SEQ. ID. No. 33]. 

(2) Cloning of the cbflgene into the piasmids containing the inducible 
promoters 

To clone the CBF1 gene into the piasmids, different PGR primers with 
suitable restriction sites for each plasmid are used to isolate cbf1 gene from 
Arabidopsis thaliana genomic DNA. The primers that may be used are listed in 
Table 5. 



TABLE 5 



Primer name Cloning sites 



DrebZa 


Cbf1 -reversal 


BgNI (AGATCT) 




Cbf1-forward1 


BamHI (GGATCC) 


P5CS 


Cbf1 -reverse 1 


Bglll (AGATCT) 




Cbf1 -forward 1 


BamHI (GGATCC) 


Rd22 


Cbf 1 -reverse2 


Kpnl (GGTACC 




Cbf1 -forward 1 


BamHI (GGATCC) 


Rd29a 


Cbf1-reverse2 


Kpnl (GGTACC 




Cbf 1 -forward 1 


BamHI (GGATCC) 


Rd29b 


Cbf1-reverse2 


Kpnl (GGTACC 




Cbf1 -forward 1 


BamHI (GGATCC) 


Rab18 


Cbf1 -reverse 1 


Bglll (AGATCT) 




Cbf1-forward2 


Xbal (TCTAGA 


Cor47 


Cbf 1 -reverse 1 


Bglll (AGATCT) 




Cbf 1 -forward 1 


BamHI (GGATCC) 



Two of the four available PGR primers (Table 5) are used for cloning 
the at-cbfl gene into the expression vectors containing each inducible 
promoter described above. The four primers have these sequences: cbf1- 
reverse 1 5'-GGAAGATGTTGAAAGAGAGTACTGTGATGAATGAACTG-3' 
[SEQ. ID. No. 34], cbf 1 -fonward 1 5'- 

GGCGGATCGCTCGTTTGTAGAAGAATAAAATAAAATAAAATG-3' [SEQ. ID. 
No. 35], cbf 1 -reverse 2 5'- 
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GGGGTACCTGAAACAGAGTACTCTGATCAATGAACTC.3' [SEQ. ID. No. 36] 
and cbf1 -forward 2 5'- 

GCTCTAGACTCGTn-CTACAACAATAAAATAAAATAAAATG-3' [SEQ ID No 
37]. For example, for the Dreb2a, P5CS. and COR47 promoters that are 
ligated to a BamHI and Bglll flanked insert, the cbfl-reverse 1 and cbf1-fon/vard 
1 primers [SEQ. ID. No. 34 and 35. respectively] are used to isolate obf1 gene 
from Arabidopsis thsliana genomic DNA. The c6ft-reverse primer includes a 
Bglll (AGAICI) restriction site near the S'-end of the primer and cbf Uomart 
pnmer has a BamHI (GGATCC) restriction site near the 5'-end of the primer A 
PGR product of 764 bp is expected. The genomic DNA (10 ng) is used as a 
template in a PGR reaction under conditions suggested by the manufacturer 
(Boehnnger Mannheim). The reaction conditions to be used in this PGR 
experiment are as follows: Segment 1 : 9400. 2 minutes; Segment 2- 9400 30 
seconds; SS^C, 1 minute; 72-0, 1 minute, for a total of 35 cycles; Segment 3- 
72«'C fori 0 minutes. 

The PGR products are subject to electrophoresis in a 0.8% agarose gel 
and visualized by ethidium bromide staining. The DNA fragment containing 
cbf1 IS excised and purified by using a Qiaquick gel extraction kit (Qiagen CA) 
The punfied fragment and the vector pMBI2001 containing the inducible 
promoter (Table 5) are each digested with Bglll and BamHI restriction enzymes 
at 3700 for 2 hours. Both DNA samples are purified by using the Qiaquick 
punfication kit (Qiagen, CA) and ligated at a ratio of 3: 1 (vector to insert) 
Ligation reactions using T4 DNA ligase (New England Biolabs. MA) are carried 
out at I60C for 16 hours. The ligated DNAs are transformed into competent 
cells of the E. coli strain DH5 by using the heat shock method The 
transformation are plated on LB plates containing 100 (g/ml spectinomycin 
(Sigma). 

Individual colonies are grown overnight in five milliliters of LB broth 
containing 100 g/ml spectinomycin at 37°C. Plasmid DNA are purified by using 
Qiaquick Mini Prep kits (Qiagen, CA). The presence of the cbf1 insert is verified 
by restriction mapping with Bglll and BamHI. The plasmid DNA is also subject 
to double-strand DNA sequencing analysis by using vector primer E9 1(5-- 
CAAACTCAGTAGGATTCTGGTGTGT-3') [SEQ. ID. No. 33]. The other 
primers shown in Table 5 and appropriate restriction enzymes are used in a 



84 



wo 99/38977 



PCT/US99/01895 



Similar way to clone the Cbf1 gene into plasmids containing the other inducible 
promoters. The resulting plasmids are listed in Table 6 and shown in Figures 
17A-17G. 

A similar cloning strategy may be used to clone other genes, such as 
cbf2, cbf3, and the other full length CBF genes listed in Table 9 and shown in 
figure 18 (new CBF gene table) into plasmids containing inducible promoters 



Construct name 

PMBI2008 
PMBI2009 
PMBI2010 
PMBI2011 
PMBI2012 
PMBI2013 
PMBI2014 



Table 6. 

Promoter name 

Dreb2a 

P5CS 

Rd22 

Rd29a 

Rd29b 

Rab18 

Cor47 



Figure name 

FIGURE 17A 
FIGURE 17B 
FIGURE 17C 
FIGURE 17D 
FIGURE 17E 
FIGURE 17F 
FIGURE 17G 



(3) Cloning of the cbf1 gene into the plasmids containing seed-specific 
promoters 



Several different CBF coding regions with different translational efficiencies 
in arabidopsis thaliana are cloned into expression vectors pMEN1001 
piVIEN1002. and pMEN1003 to produce different levels of CBF protein in 
transgenic plants. The 5' and 3' PCR primers used to isolate cbf1 gene from 
arabidopsis thaliana genomic DNA are listed below. 

5'-Primer cbf5pri.atg.seq for isolating cbf1. 1 gene [SEQ. ID. No. 96] 
5'-ggaagatctatGAAACAGAGTACTCTGATCAATGAACTC-3' 

5-Primer cbf5pri.wt.seq for isolating cbf1.2 gene [SEQ. ID. No. 97] 
5'-ggaagatctGAAACAGAGTACTCTGATCAATGAACTC-3' 

5--Primer cbf5pri.inframeatg.seq for isolating cbf1.3 gene [SEQ. ID. No 98] 

5'-ggaagatctatGAACAGAGTACTCTGATCAATGAACTC-3' 
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5'-Primer cbf5pri.dbatg.seq for isolating cbf1.4 gene [SEQ. ID. No. 99] 
5'-ggaagatctatGAACAGAGTACTCTGATgCAATGAACTC-3' 

3'-Primer cbf1.long3pri.seq for isolating cbf1.1-4 genes [SEQ. ID. No. 100] 
5'-ggaggatcCTCGTTTCTACAACAATAAAATAAAATAAAATGAAGGAACC- 



The cbf1 gene is cloned into pMENOSO at restriction sites Hindlll and 
BamHI by using a similar strategy as described in subsection (2) of this section 
for cloning of the cbf1 gene into the plasmids containing the inducible 
promoters. The resulting constructs containing cbf1.1-4 genes are 
pMENIOOI.1-4 plasmids, pMEN1002.1-4 plasmids and pMEN1003.1-4 
piasmids, respectively. The presence of the cbf1 gene inserts is verified by 
restriction mapping with Hindlll and BamHI restriction enzymes to cut out the 
cloned insert. The plasmid DNA is also subject to double-strand DNA 
sequencing analysis using a vector primer (E9. 1 primer 5'- 
CAAACTCAGTAGGATTCTGGTGTGT-3' [SEQ. ID. No. 33]. 

D. Transfomnation of Agrobacterium with Plasmids Containing 
CBF1 Gene and Inducible or Tissue-Specific Promoters 

After the plasmid vectors containing cbf1 gene and inducible promoters 
are constructed, these vectors are used to transform Agrobacterium 
tumefaciens cells expressing the gene products. The stock of Agrobacterium 
tumefaciens cells for transfonnation are made as described by Nagel et al. 
FEMS Microbiol Letts 67: 325-328 (1990). Agrobacterium strain ABI is grown 
in 250 ml LB medium (Sigma) overnight at ZB^C with shaking until an 
absorbance (Aeoo) of 0.5 - 1.0 is reached. Cells are harvested by centrifugation 
at 4,000 X g for 1 5 min at 4 "C. Cells are then resuspended in 250 mI chilled 
buffer (1 mM HEPES, pH adjusted to 7.0 with KOH). Cells are centrifuged 
again as described above and resuspended in 125 mI chilled buffer. Cells are 
then centrifuged and resuspended two more times in the same HEPES buffer 
as described above at a volume of 100 pi and 750 pi, respectively. 
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Resuspended cells are then distributed into 40 pi aliquots, quickly frozen in 
liquid nitrogen, and stored at -80*^C. 

Agrobactehum cells are transformed with plasmids formed as described 
above in Section 4B(2) following the protocol described by Nagel et al. FEMS 
5 Microbiol Letts 67: 325-328 (1990). For each DNA construct to be transformed, 
50- 100 ng DNA (generally resuspended in 10 mM Tris-HCI. 1 mM EDTA, pH 
8.0) is mixed with 40 |j| of Agrobactehum cells. The DNA/cell mixture is then 
transferred to a chilled cuvette with a 2mm electrode gap and subject to a 2.5 
kV charge dissipated at 25 pF and 200 pF using a Gene Pulser II apparatus 

10 (Bio-Rad). After electroporation, cells are immediately resuspended in 1.0 ml 
LB and allowed to recover without antibiotic selection for 2 - 4 hours at 28 '^C in 
a shaking incubator. After recovery, cells are plated onto selective medium of 
LB broth containing 100 pg/ml spectinomycin (Sigma) and incubated for 24-48 
h at 28 °C- Single colonies are then picked and inoculated in fresh medium. 

15 The presence of the plasmid constnjct are verified by PCR amplification and 
sequence analysis. 

E. Transfomriation of Arabidopsis Plants with Agrobacterium 
tumefaciens Carrying Expression Vector for CBF1 Protein 

20 

After transformation of Agrobactehum tumefaciens with plasmid vectors 
containing cbf1 gene and inducible promoters, single Agrobacterium colonies 
containing each of pMBI2008 - pMBI2014 are identified, propagated, and used 
to transform Arabidopsis Plants. Briefly, 500 mi cultures of LB medium 

25 containing lOOug/ml spectinomycin are inoculated with the colonies and grown 
at 28 C with shaking for 2 days until an absorbance (Aeoo) of > 2.0 is reached, 
Celts are then harvested by centrifugation at 4,000 x g for 10 min, and 
resuspended in infiltration medium (1/2 X Murashige and Skoog salts (Sigma), 
1 X Gamborg's B-5 vitamins (Sigma), 5.0% (w/v) sucrose (Sigma), 0.044 pM 

30 benzylamino purine (Sigma), 200 \j\/L Silwet L-77 (Lehle Seeds) until an 
absorbance (Asoo) of 0.8 is reached. 

Prior to transformation, Arabidopsis thaliana seeds (ecotype Columbia) 
are sown at a density of -10 plants per 4" pot onto Pro-Mix BX potting medium 
(Hummert International) covered with fiberglass mesh (18 mm X 16 mm). 

35 Plants are grown under continuous illumination (50-75 pE/m^/sec) at 22-23 C 
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With 65-70% relative humidity. After about 4 weeks, primary inflorescence 
stems (bolts) are cut off to encourage growth of multiple secondary bolts. After 
flowering of the mature secondary bolts, plants are prepared for transfonnation 
by removal of all siliques and opened flowers. 

The pots are then immersed upside down in the mixture of 
Agrobacterium / infiltration medium as described above for 30 sec, and placed 
on their sides to allow draining into a r x 2' fiat surface covered with plastic 
wrap. After 24 h, the plastic wrap is removed and pots are turned upright. The 
immersion procedure is repeated one week later, for a total of two immersions 
per pot. Seeds are then collected from each transformation pot and analyzed 
following the protocol described below. 

F. Identification of Arabidopsis Primary Transformants 

Seeds collected from the transformation pots are sterilized essentially 
as follows. Seeds are dispersed into in a solution containing 0.1% (v/v) Triton 
X-100 (Sigma) and sterile HjO and washed by shaking the suspension for 20 
min. The wash solution is then drained and replaced with fresh wash solution 
to wash the seeds for 20 min with shaking. After removal of the second wash 
solution, a solution containing 0.1% (v/v) Triton X-100 and 70% EtOH 
(Equistar) is added to the seeds and the suspension is shaken for 5 min. After 
removal of the ethanol/detergent solution, a solution containing 0.1% (v/v) 
Triton X-IOO and 30% (v/v) bleach (Chlorox) is added to the seeds, and the 
suspension is shaken for 10 min. After removal of the bleach/detergent 
solution, seeds are then washed five times in sterile distilled HjO. The seeds 
are stored in the last wash water at 4 °C for 2 days in the dark before being 
plated onto antibiotic selection medium (1 X Murashige and Skoog salts (pH 
adjusted to 5.7 with 1M KOH), 1 X Gamborg's B-5 vitamins. 0.9% phytagar 
(Life Technologies), and 50 ug/L kanamycin). Seeds are germinated under 
continuous illumination (50-75 pE/m^/sec) at 22-23 °C. After 7-10 days of 
growth under these conditions, kanamycin resistant phmary transformants (Ti 
generation) are visible and are obtained for each of constructs pMBI2008 - 
PIWBI2014. These seedlings are transferred first to fresh selection plates 
where the seedlings continued to grow for 3-5 more days, and then to soil (Pro- 

88 



wo 99/38977 



PCTAJS99/01895 



Mix BX potting medium). Progeny seeds (T2) are collected; kanamycin 
resistant seedlings selected and analyzed as described above. 

G. Transformation of Cereal Plants with Plasmid Vectors 
Containing cbfl Gene and Inducible Promoters 

Cereal plants, such as corn, wheat, rice, sorghum and bariey, can also 
be transformed with the plasmid vectors containing the cbf genes and inducible 
promoters to increase their tolerance to environmental stresses. In these 
cases, the cloning vector, pMEN020, is modified to replace the Nptll coding 
region with the BAR gene of Streptomyces hygroscopicus 
that confers resistance to phosphinothricin. The Kpnl and Bglll sites of the Bar 
gene are removed by site-directed mutagenesis with silent codon changes. 
After cloning of the inducible promoters into the modified plasmid by the same 
procedures described above, the a/-c6f coding region of cbf1 ger^e is inserted 
into the plasmid following the same procedures as described above. The 
resulted plasmids are listed in Table 7. 



Table 7 



Promoter name 


Construct name 


Dreb2a 


PMBI2015 


P5CS 


PMB12016 


Rd22 


PMBI2017 


Kd29a 


PMBI2018 


Rd29b 


PMBI2019 


Rab18 


PMBI2020 


Cor47 


PMBI2021 



It is now routine to produce transgenic plants of most cereal crops 
(Vasil, I., Plant Molec. Biol. 25: 925-937 (1994)) such as com. wheat, rice, 
sorghum (Cassas. A. et al.. Proc. Natl. Acad Sci USA 90: 1 1212-11216 (1993) 
and barley (Wan, Y. and Lemeaux, P. Plant Physiol. 104:37-48 (1994) Other 
direct DNA transfer methods such as the microprojectile gun or Agrobacterium 
fumefec/ens-mediated transformation can be used for corn (Fromm. et al. 
Bio/Technology 8: 833-839 (1990); Gordon-Kamm et al. Plant Cell 2: 603-618 
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(1990); Ishida, Y.. Nature Biotechnology 14:745-750 (1990)), wheat (Vasil, et 
al. Bio/Technology 10:667-674 (1992) ; Vasil etal., Bio/Technology 11:1553- 
1558 (1993); Weeks et al., Plant Physiol. 102:1077-1084 (1993)), rice 
(Christou BiorTechnology 9:957-962 (1991); Hiei et al. Plant J. 6:271-282 
(1994); Aldemita and Hodges. Planta 199:612-617; Hiei et al., Plant Mol Biol. 
35:205-18 (1997)). For most cereal plants, embryogenic cells derived from 
immature scutellum tissues are the preferred cellular targets for transformation 
(Hiei et a!.. Plant Mol Biol. 35:205-18 (1997); Vasil. Plant Molec. Biol. 25: 925- 
937 (1994)). 

Plasmids according to the present invention may be transformed into 
corn embryogenic cells derived from immature scutellar tissue by using 
microprojectile bombardment, with the A188XB73 genotype as the preferred 
genotype (Fromm, et al., Bio/Technology 8: 833-839 (1990); Gordon-Kamm et 
al., Plant Cell 2: 603-618 (1990)). After microprojectile bombardment the 
tissues are selected on phosphinothricin to identify the transgenic embryogenic 
cells (Gordon-Kamm et al., Plant Cell 2: 603-618 (1990)). Transgenic plants 
are regenerated by standard corn regeneration techniques (Fromm. et al., 
Bio/Technology 8: 833-839 (1990); Gordon-Kamm et al., Plant Cell 2: 603-618 

(1990) ). 

The plasmids prepared as described above can also be used to 
produce transgenic wheat and rice plants (Christou, Bio/Technology 9:957-962 

(1991) ; Hiei et al.. Plant J. 6:271-282 (1994) ; Aldemita and Hodges, Planta 
199:612-617 (1996); Hiei et al.. Plant Mol Biol. 35:205-18 (1997)) by following 
standard transformation protocols known to those skilled in the art for rice and 
wheat (Vasil, et al. Bio/Technology 10:667-674 (1992) ; Vasil et al., 
Bio/Technology 11:1553-1558 (1993); Weeks et al., Plant Physiol. 102:1077- 
1084 (1993)). where the BAR gene is used as the selectable marker. 

H. Transformation of Plants with Plasmid Vectors 

Containing cbf1 Gene and Seed-Specific Promoters 

The binary constructs containing seed-specific napin promoters 
(pMENIOOI.1-4; pMEN1002.1-4; and pMEN1003.1-4) are used to transform 
canola and rapeseed plants as described (Moloney et al. US Patent No. 
5,750,871). except that the Bar gene selectable marker is used. These 
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constructs are also used to transform regenerable barley cells by 
microprojectile bombardment (Wan and Lemaux, Plant Physiol. 104: 37-48 
(1994)). After bombardment the tissues are selected on phosphinothricin to 
identify the transgenic embryogenic cells. Transgenic cells are regenerated by 
5 standard barley regeneration techniques (Wan and Lemaux Plant Physiol. 
104: 37-48 (1994)). 

5. Identification of CBF1 Homologs CBF2 and CBF3 Using CBF1 

10 This example describes two homologs of CBF1 from Arabidopsis 

thaliana and named them CBF2 and CBF3. 

CBF2 and CBF3 have been cloned and sequenced as described below. 
The sequences of the DNA and encoded proteins are set forth in SEQ ID NOS: 
12, 13, 14 and 15. Figure 12 shows the DNA sequence for CBF2 encoding 

15 CBF2. Figure 13 shows the DNA sequence for CBF3 encoding CBF3. 

A lambda cDNA library prepared from RNA isolated from Arabidopsis 
thaliana ecotype Columbia (Lin and Thomashow, Plant Physiol. 99: 519-525 
(1992)) was screened for recombinant clones that carried inserts related to the 
CBF1 gene (Stockinger, E. J., et al., Proc Natl Acad Sci USA 94:1035-1040 

20 (1997)). CBF1 was ^^P-radiolabeled by random priming (Sambrook et ah, 

Molecular Cloning. A Laboratory Manual, Ed. 2, Cold Spring Harbor Laboratory 
Press, New York (1989)) and used to screen the library by the plaque-lift 
technique using standard stringent hybridization and wash conditions (Hajela, 
R. K., et al., Plant Physiol 93:1246-1252 (1990); Sambrook et al., Molecular 

25 Cloning. A Laboratory Manual, Ed 2. Cold Spring Harbor laboratory Press, New 
York (1989) 6 X SSPE buffer, 60 °C for hybridization and 0.1 X SSPE buffer 
and 60 ^C for washes). Twelve positively hybridizing clones were obtained and 
the DNA sequences of the cDNA inserts were determined at the MSU-DOE 
Plant Research Laboratory sequencing facility. The results indicated that the 

30 clones fell into three classes. One class carried inserts corresponding to 

CBF1. The two other classes carried sequences corresponding to two different 
homologs of CBF1, designated CBF2 and CBF3. The nucleic acid sequences 
and predicted protein coding sequences for CBF1, CBF2 and CBF3 appear at 
Figure14. 
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A comparison of the nucleic acid sequences of CBF1, CBF2 and CBF3 
indicate that they are 83 to 85% identical as shown in Table 8. Figure 14 
shows the amino acid alignment of proteins CBF1 , CBF2 and CBF3. 

TABLE 8 



Percent identity^ 



DNA^ 



Polypeptide 



cbf1/cbf2 



85 



86 



cbf1/cbf3 



83 



84 



cbf2/cbf3 



84 



85 



Percent identity was determined using the Clustal algorithm from the 
Megalign program (DNASTAR, Inc.). 



10 ^ Comparisons of the nucleic acid sequences of the open reading frames are 
shown. 



Similarly, the amino acid sequences of the three CBF polypeptides 
15 range from 84 to 86% identity. An alignment of the three amino acidic 

sequences reveals that most of the differences in amino acid sequence occur 
in the acidic C-terminal half of the polypeptide. This region of CBF1 serves as 
an activation domain in both yeast and Arabidopsis (not shown). 

Residues 47 to 106 of CBF1 con^espond to the AP2 domain of the 
20 protein, a DNA binding motif that to date, has only been found in plant proteins. 
A comparison of the AP2 domains of CBF1, CBF2 and CBF3 indicates that 
there are a few differences in amino acid sequence. These differences in 
amino acid sequence might have an effect on DNA binding specificity. 



92 



>NSDOCID: <WO 9938977A2 1 > 



wo 99/38977 



PCTAJS99/01895 



6. Activation of Transcription In Y ast 

Containing C-repeat/ORE Using CBF1, CBF2 and CBF3 

This example shows that CBF1, CBF2 and CBF3 activate transcription 
in yeast containing CRT/DREs upstream of a reporter gene. The CBFs were 
expressed in yeast under control of the ADC1 promoter on a 2\j plasmid 
(PDB20.1; Berger. S. L, et at.. Cell 70:251-265 (1992)). Constructs expressing 
the different CBFs were transformed into yeast reporter strains which had the 
indicated CRT/DRE upstream of the lacZ reporter gene. Copy number of the 
CRT/DREs and its orientation relative to the direction of transcription from each 
promoter is indicated by the direction of the an-ow. 

Figure 15 is a graph showing transcription regulation of CRT/DRE 
containing reporter genes by CBF1, CBF2 and CBF3 genes in yeast. In Figure 
15, the vertical lines across the arrows of the COR15a construct represent the 
m3cor15a mutant CRT/DRE construct. Each CRT/DRE-lacZ construct was 
integrated into the URA3 locus of yeast. Error bars represent the standard 
deviation derived from three replicate transformation events with the same CBF 
activator construct into the respective reporter strain. Quantitative B-gal 
assays were performed as described by Rose and Botstein (Rose, M., et al., 
Methods Enzymol. 101:167-180 (1983)). 

7. Homologous CBF Encoding Genes In Other Plants. 

This example shows that homologous sequences to CBF1 are present 
in other plants. The presence of these homologous sequences suggest that 
the same or similar cold regulated environmental stress response regulatory 
elements such as the C-repeat/DRE of Arabidopsis (CCGAC) exist in other 
plants. This example serves to indicate that genes with significant homology to 
CBF1 , CBF2 and CBF3 exist in a wide range of plant species. 

Total plant DNAs from Arabadopsis thaliana, Nicotiana tabacum, 
Lycopersicon pimpinellifolium, Prunis avium, Prunus cerasus, Cucumis 
sativus, and Oryza sativa were isolated according to Stockinger al (Stockinger, 
E. J., et al., J. Heredity, 87:214-218 (1996)). Approximately 2 to 10 pg of each 
DNA sample was restriction digested, transferred to nylon membrane (Micron 
Separations, Westboro, MA) and hybridized according to Walling et al. 
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(Walling, L. L, et al., Nucleic Acids Res. 16:10477-10492 (1988)). 
Hybridization conditions were: 42 °C in 50% formamide, 5X SSC, 20 mM 
phosphate buffer 1X Denhardt's, 10% dextran sulfate, and 10Qpg/ml herring 
sperm DNA. Four low stringency washes at RT in 2X SSC, 0.05% Na sarcosyl 
5 and 0.02% Na4 pyrophosphate were perfonmed prior to high stringency washes 
at 55 °C in 0.2X SSC, 0.05% Na sarcosyl and 0.01% Na4 pyrophosphate. High 
stringency washes were performed until no counts were detected in the 
washout. The Bcll-Bglli fragment of CBF1 (Stockinger et al., Proc Natl Acad 
Sci USA 94:1035-1040 (1997)) was gel isolated (Sambrook et al., Molecular 

10 Cloning. A Laboratory Manual, Ed 2. Cold Spring Harbor Laboratory Press, 
New York (1989)) and direct prime labelled (Feinberg and Vogelstein, Anal. 
Biochem 132; 6-13 (1982)) using the primer MT117 
(TTGGCGGCTACGAATCCC; SEQ ID NO: 16). Specific activity of the 
radiolabelled fragment was approximately 4x10® cpm/^g. Autoradiography 

15 was performed using HYPERFILM-MP (Amersham) at -80 ^'C with one 
intensifying screen for 1 5 hours. 

Autoradiography of the gel showed that DNA sequences from 
Arabadopsis thaliana, Nicotiana tabacum, Lycopersicon pimpinellifolium, 
Prunis avium, Prunus cerasus, Cucumis sativus, and Oryza sativa hybridized 

20 to the labeled Bell, Bglll fragment of CBF1. These results suggest that 

homologous CBF encoding genes are present in a variety of other plants. 

8. identification Of Homologous Sequence To CBF1 In Canola 

25 This example describes the identification of homologous sequences to 

CBF1 in canola using PCR, Degenerate primers were designed for regions of 
AP2 binding domain and outside of the AP2 (carboxyl terminal domain). More 
specifically, the following degenerate PCR primers were used: 

Mol 368 (reverse) 5'- CAY CCN ATH TAY MGN GGN GT -3' 

30 

Mol 378 (fonward) 5'- GGN ARN ARC ATN CCY TCN GCC -3' 
(Y: C/T. N: A/C/G/T, H: A/C/T, M: A/C, R: A/G ) 
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Primer Mol 368 is in the AP2 binding domain of CBF1 (amino acid seq: H P I 
Y R G V) while primer Mo! 378 is outside the AP2 domain (carboxyl terminal 
domain)(amino acid seq: M A E G M L L P). 

The genomic DNA isolated from Brassica Napus was PGR amplified by 
using these primers following these conditions: an initial denaturation step of 2 
min at 93 °C; 35 cycles of 93 for 1 min, 55 °C for 1 min, and 72 for 1 min ; 
and a final incubation of 7 min at 72 at the end of cycling. 

The PGR products were separated by electrophoresis on a 1 .2% 
agarose get and, transferred to nylon membrane and hybridized with the AT 
CBF1 probe prepared from Arabidopsis genomic DNA by PGR amplification. 
The hybridized products were visualized by colormetric detection system 
(Boehringer Mannheim) and the corresponding bands from a similar agarose 
gel were isolated (By Qiagen Extraction Kit). The DNA fragments were ligated 
into the TA clone vector from TOPO TA Gioning Kit (Invitrogen) and 
transformed into E. coli strain TOP10 (Invitrogen). 

Seven colonies were picked and the inserts were sequenced on an ABI 
377 machine from both strands of sense and antisense after plasmid DNA 
isolation. The DNA sequence was edited by sequencer and aligned with the 
AtCBFI by GGG software and NGBI blast searching. 

Figure 16 shows an amino acid sequence of a homolog [GAN1; SEQ. 
ID. No. 17] identified by this process and its alignment to the amino acid 
sequence of GBF1 . The nucleic acid sequence for GAN1 is listed herein as 
SEQ. ID. No, 18. 

As illustrated in Figure 16. the DNA sequence alignment in four regions 
of BN-GBF1 shows 82% identity in the AP2 binding domain region and range 
from 75% to 83% with some alignment gaps due to regions of lesser homology 
or introns in the genomic sequence. The aligned amino acid sequences show 
that the BNGBF1 gene has 88% identity in the AP2 domain region and 85% 
identity outside the AP2 domain when aligned for two insertion sequences that 
are outside the AP2 domain. The extra amino acids in the 2 insertion regions 
are either due to the presence of introns in this region of the BNGBF1 gene, as 
it was derived from genomic DNA, or could be due to extra amino acids in 
these regions of the BNGBF1 gene. Isolation and sequencing of a cDNA of the 
BNGBF1 gene using the genomic DNA as a probe will resolve this. 
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9. Id ntificati n Of Homologous Sequenc To CBF1 
In Canola and other Species 

A PCR strategy similar to that described in Example 8 was used to 
isolate additional CBF homologues from Brassica juncea, Brassica napus. 
Brassica oleracea. Brassica rapa. Glycine max, Raphanus sativus and Zea 
Maize. The nucleotide (e.g. bjCBFI) and peptide sequences (e.g. BJCBF1- 
PEP) of these isolated CBF homologues are shown in Figures 18A and 18B. 
respectively. Table 9 lists the sequence names and sequence ID Nos. of these 
isolated CBF homologues. The PCR primers are internal to the gene so partial 
gene sequences are initially obtained. The full length sequences of some of 
these genes were further isolated by inverse PCR or ligated linker PCR. One 
skilled in the art can use the conserved regions in these genes to design PCR 
primers to isolate additional CBF genes. 



Table 9 



DNA Seq. Name 


Seq. ID No. 


Peptide Seq. Name 


Seq. ID No. 


bjCBFI 


38 


BJCBF1-PEP 


39 


bjCBF2 


40 


BJCBF2-PEP 


41 


bjCBF3 


42 


BJCBF3-PEP 


43 


bjCBF4 


44 


BJCBF4-PEP 


45 


bnCBFI 


46 


BNCBF1-PEP 


47 


bnCBF2 


48 


BNCBF2-PEP 


49 


bnCBF3 


50 


BNCBF3-PEP 


51 


bnCBF4 


52 


BNCBF4-PEP 


53 


bnCBFS 


54 


BNCBF5-PEP 


55 


bnCBF6 


56 


BNCBF6-PEP 


57 


bnCBF7 


58 


BNCBF7-PEP 


59 


bnCBFS 


60 


BNCBF8-PEP 


61 


bnCBF9 


62 


BNCBF9-PEP 


63 


boCBFI 


64 


BOCBF1-PEP 


65 


boCBF2 


66 


BOCBF2-PEP 


67 


boCBF3 


68 


B0CBF3-PEP 


69 
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DNA Seq. Name 


Seq. ID N . 


Peptide Seq. Name 


Seq. ID No. 


boCBF4 


70 


BOCBF4-PEP 


71 


boCBFS 


72 


BOCBF5-PEP 


73 


brCBFI 


74 


BRCBF1-PEP 


75 


brCBF2 


76 


BRCBF2-PEP 


77 


brCBF3 


78 


BRCBF3-PEP 


79 


brCBF4 


80 


BRCBF4-PEP 


81 


brCBFS 


82 


BRCBF5-PEP 


83 


brCBF6 


84 


BRCBF6-PEP 


85 


brCBF7 


86 


BRCBF7-PEP 


87 


gmCBFI 


88 


GMCBF1-PEP 


89 


rsCBFI 


90 


RSCBF1-PEP 


91 


rsCBF2 


92 


RSCBF2-PEP 


93 


zmCBFI 


94 


ZMCBF1-PEP 


95 



Figure 19A shows an amino acid alignment of tine AP2 domains of the 
CBF proteins listed in Table 9 with their consensus sequences highlighted. 
Figure 19A also provides a comparison of the consensus sequence with that of 
the tobacco DNA binding protein EREBP2 (Okme-Takagi. M., et al., The Plant 
Cell 7:173-182 (1995). The sequences of these CBF proteins are BRCBF3- 
PEP [SEQ. ID. No. 79], BRCBF6-PEP [SEQ. ID. No.85], BNCBF5-PEP [SEQ. 
ID. No. 55], ATCBF2-PEP [SEQ. ID. No. 13], ATCBF3-PEP [SEQ. ID. No. 15], 
ATCBF1-PEP [SEQ. ID. No. 2], BNCBF2-PEP [SEQ. ID. No. 49], BNCBF6- 
PEP [SEQ. ID. No. 57], BOCBFS-PEP [SEQ. ID. No. 69], BNCBF3-PEP [SEQ. 
ID. No. 51], BNCBFS-PEP [SEQ. ID. No. 61], BNCBF9-PEP [SEQ. ID. No. 63], 
BRCBF2-PEP [SEQ. ID. No. 77], BOCBF5-PEP [SEQ. ID. No. 73], BOCBF2- 
PEP [SEQ. ID. No. 67], RSCBF2-PEP [SEQ. ID. No. 93], BNCBF4-PEP [SEQ. 
ID. No. 53], BNCBF7-PEP [SEQ. ID. No. 59], BOCBF4-PEP [SEQ. ID. No. 71], 
BRCBF7-PEP [SEQ. ID. No. 87], BRCBF4-PEP [SEQ. ID. No. 81], BRCBF5- 
PEP [SEQ. ID. No. 83], RSCBFI -PEP [SEQ. ID. No. 91], BJCBF2-PEP [SEQ. 
ID. No. 41], BJCBF3-PEP [SEQ. ID. No. 43], BNCBF1-PEP [SEQ. ID. No. 47], 
B0CBF1-PEP [SEQ. ID. No. 65], BRCBF1-PEP [SEQ. ID. No. 75], BJCBF4- 
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PEP [SEQ. ID. No. 45], ZMCBF1-PEP [SEQ. ID. No. 95], and GMCBF1-PEP 
[SEQ. ID. No. 89]. 

As can be seen from the consensus sequence shown in Figure 19A, a 
significant portion of the AP2 domain is conserved among the different CBF 
proteins, in view of this data, Applicants use the conserved sequence in the 
AP2 domain to define a class of AP2 domain proteins comprising this 
conserved sequence. 

Figure 198 shows an amino acid alignment of the AP2 domains shown 
in Figure 1 9A and dreb2a and dreb2b and a consensus sequence between the 
proteins highlighted. As can be seen, a very high degree of homology exists 
between AP2 domains shown in Figure 19A and dreb2a and dreb2b. 
Applicants employ the conserved sequence in the AP2 domain shown in Figure 
19B to define a broader class of AP2 domain proteins which are capable of 
binding to CCG regulatory region. 

Figure 19C shows an amino acid alignment of the AP2 domains shown 
in Figure 19B and tiny and a consensus sequence between the proteins 
highlighted. As can be seen, a very high degree of homology exists between 
AP2 domains shown in Figure 19A, dreb2a, dreb2b and tiny. Applicants 
employ the conserved sequence in the AP2 domain shown in Figure 19C to 
define a yet broader class of AP2 domain proteins which are capable of binding 
to CCG regulatory region. 

Figure 19D shows a consensus sequence corresponding to the 
difference between the consensus sequence shown in Figures 19A and tiny. 
Applicants employ the highlighted portion of the conserved sequence shown in 
Figure 19D to define a group of amino acid residues which may be critical to 
binding to a CCG regulatory region. 

Figure 19E shows a consensus sequence corresponding to the 
difference between the consensus sequence shown in Figures 19B and tiny. 
Applicants employ the highlighted portion of the conserved sequence shown in 
Figure 19E to define another group of amino acid residues which may be 
critical to binding to a CCG regulatory region. 

Figure 20 shows the amino acid alignment of the amino terminus of the 
CBF proteins with their consensus sequence highlighted. The sequences of 
these CBF proteins are: BRCBF3-PEP [SEQ. ID. No. 79], BRCBF6-PEP [SEQ. 

98 



wo 99/38977 



PCT/US99/0189S 



ID. No.85). BNC8F5-PEP [SEQ. ID. No. 55]. ATCBF2-PEP [SEQ. ID. No. 13], 
ATCBF3-PEP [SEQ. ID. No. 15], ATCBF1-PEP [SEQ. ID. No. 2], BNCBF2- 
PEP [SEQ. ID. No. 49], BNCBF6-PEP [SEQ. ID. No. 57], BOCBF3-PEP (SEQ. 
ID. No. 69], BNCBF3-PEP [SEQ. ID. No. 51], BNCBF8-PEP [SEQ. ID. No. 61], 
BNCBF9-PEP [SEQ. ID. No. 63]. BRCBF2-PEP [SEQ. ID. No. 77], BOCBF5- 
PEP [SEQ. ID. No. 73], BOCBF2-PEP [SEQ. ID. No. 67], RSCBF2-PEP [SEQ. 
ID. No. 93], BNCBF4-PEP [SEQ. ID. No. 53 ]. BNCBF7-PEP [SEQ. ID. No. 59], 
BOCBF4-PEP [SEQ. ID. No. 71]. BRCBF7-PEP [SEQ. ID. No. 87], BRCBF4- 
PEP [SEQ. ID. No. 81], BRCBF5-PEP [SEQ. ID. No. 83], and RSCBF1-PEP 
[SEQ. ID. No. 91]. 

As can be seen from the consensus sequence shown in Figure 20. a 
significant portion of the amino terminus of CBF proteins is conserved among 
the different CBF proteins. In view of this data. Applicants employ the 
conserved sequence in the amino terminus domain to define a class of proteins 
comprising this conserved sequence. 

Figure 21A shows the amino acid alignment of the carboxy terminus of 
24 CBF proteins with their consensus sequences highlighted. The sequences 
of these CBF proteins are; BRCBF6-PEP [SEQ. ID. No.85]. BNCBF5-PEP 
[SEQ. ID. No. 55], ATCBF2-PEP [SEQ. ID. No. 13], ATCBF3-PEP [SEQ. ID. 
No. 15], ATCBF1-PEP [SEQ. ID. No. 2], BNCBF2-PEP [SEQ. ID. No. 49], 
BNCBF6-PEP [SEQ. ID. No. 57], BOCBF3-PEP [SEQ. ID. No. 69], BNCBF3- 
PEP [SEQ. ID. No. 51], BNCBF8-PEP [SEQ. ID. No. 61], BNCBF9-PEP [SEQ. 
ID. No. 63], BRCBF2-PEP [SEQ. ID. No. 77], BOCBF5-PEP [SEQ. ID. No. 73], 
RSCBF2-PEP [SEQ. ID. No. 93], BNCBF4-PEP [SEQ. ID. No. 53], BNCBF7- 
PEP [SEQ. ID. No. 59], BOCBF4-PEP [SEQ. ID. No. 71]. BRCBF7-PEP [SEQ. 
ID. No. 87], BRCBF5-PEP [SEQ. ID. No. 83], RSCBF1-PEP [SEQ. ID. No. 91], 
BJCBF2-PEP [SEQ. ID. No. 41], BJCBF3-PEP [SEQ. ID. No. 43], BNCBF1- 
PEP [SEQ. ID. No. 47]. and BOCBF1-PEP [SEQ. ID. No. 65]. 

As can be seen from the consensus sequence shown in Figure 21A, a 
significant portion of the carboxy terminus of CBF proteins is conserved among 
the different CBF proteins. In view of this data, Applicants employ the 
conserved sequence in the carboxy terminus domain to define a class of 
proteins comprising this conserved sequence. 
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Figure 21 B shows the amino acid alignment of the carboxy terminus of 
9 CBF proteins with their consensus sequences highlighted. The sequences of 
these CBF proteins are: BNCBF2-PEP [SEQ. ID. No. 49], BOCBF3-PEP [SEQ. 
ID. No. 69], BNCBF3-PEP [SEQ. ID. No. 51], BNCBF8-PEP [SEQ. ID. No. 61], 
BNCBF9-PEP [SEQ. ID. No. 63], BRCBF2-PEP [SEQ. ID. No. 77]. BOCBF5- 
PEP [SEQ. ID. No. 73], BNCBF1-PEP [SEQ. ID. No. 47], and BNCBF6-PEP 
[SEQ. ID. No. 57]. 

As can be seen from the consensus sequence shown in Figure 2 IB, a 
greater portion of the carboxy terminus is consen/ed when these 9 CBF 
proteins are used. In view of this data, Applicants employ the conserved 
sequence in the carboxy terminus domain to define another class of proteins 
comprising this conserved sequence. 

While the present invention is disclosed by reference to the preferred 
embodiments and examples detailed above, it is to be understood that these 
examples are intended in an illustrative rather than limiting sense, as it is 
contemplated that modifications will readily occur to those skilled in the art. 
which modifications will be within the spirit of the invention and the scope of the 
appended claims. 
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WE CLAIM: 



1 1 . A binding protein other than CBF-1 in isolated form comprising a 

2 consensus sequence sufficiently homologous to the consensus sequence 

3 shown in Figure 19A that the binding protein is capable of binding to a CCG 

4 regulatory sequence. 

1 2, A binding protein other than CBF-1 in isolated form comprising a 

2 consensus sequence sufficiently homologous the consensus sequence shown 

3 in Figure 198 that the binding protein is capable of binding to a CCG regulatory 

4 sequence. 

1 3. A binding protein other than CBF-1 in isolated form comprising a 

2 consensus sequence sufficiently homologous the consensus sequence shown 

3 in Figure 19C that the binding protein is capable of binding to a CCG regulatory 

4 sequence. 

1 4. A binding protein other than CBF-1 in isolated form comprising a 

2 consensus sequence shown in Figure 19A. 

1 5. A binding protein other than CBF-1 in isolated form comprising a 

2 consensus sequence shown in Figure 19B. 

1 6. A binding protein other than CBF-1 in isolated form comprising a 

2 consensus sequence shown in Figure 19C. 

1 7. A binding protein other than CBF-1 in isolated form comprising a 

2 sequence selected from the group consisting of an AP2 domain of SEQ. I.D. 

3 Nos. 13, 15, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61. 63. 65, 67, 69. 71. 

4 73, 75, 77. 79, 81 , 83. 85. 87, 89, 91 . 93, and 95. 

1 8. A non-naturally occurring binding protein comprising an amino acid 

2 sequence capable of binding to a CCG regulatory sequence and an amino acid 

3 sequence which fomns a transcription activation region. 
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1 9. DNA in isolated form comprising a sequence encoding any one of the 

2 binding proteins of claims 1-8. 

1 1 0. A nucleic acid construct capable of transforming a plant comprising a 

2 sequence encoding any one of the binding proteins of claims 1-8. 

1 1 1 . A nucleic acid construct according to claim 1 0, further comprising a 

2 promoter which regulates expression of the binding protein. 

1 12. A nucleic acid construct according to claim 1 1 , wherein the promoter is 

2 a tissue specific promoter. 

1 1 3. A nucleic acid construct according to claim 1 1 , wherein the promoter is 

2 a flower, fruit or seed specific promoter. 

1 14. A binding protein according to any one of claims 1-8 wherein the 

2 binding protein is a recombinant binding protein expressed within a plant. 

1 15. A recombinant cell comprising a recombinant binding protein according 

2 to any one of claims 1-8. 

1 16. A recombinant cell comprising a recombinant sequence encoding a 

2 binding protein according to any one of claims 1-8. 

1 17. A recombinant cell according to claim 16, further comprising a promoter 

2 which regulates expression of the binding protein. 

1 18- A recombinant cell according to claim 16, wherein the promoter is a 

2 tissue specific promoter. 

1 19. A recombinant cell according to claim 16, wherein the promoter is a 

2 flower, fruit or seed specific promoter. 

1 20. Plant material comprising a recombinant binding protein according to 

2 any one of claims 1-8. 
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1 21 . Plant material according to claim 20 wherein the plant material is 

2 selected from th group consisting of seeds, flowers, fruits, leaves and roots of 

3 a plant. 

1 22. Plant material comprising a recombinant sequence encoding a binding 

2 protein according to any one of claims 1-8. 

1 23. Plant material according to claim 22, further comprising a promoter 

2 which regulates expression of the binding protein. 

1 24. Plant material according to claim 22, wherein the promoter is a tissue 

2 specific promoter 

1 25. Plant material according to claim 22, wherein the promoter is a flower, 

2 fruit or seed specific promoter. 

1 26. Plant material according to any one of claims 20-25 wherein the plant 

2 material is a living plant. 

1 27. A composition of matter derived from plant material according to any 

2 one of claims 20-26. 

1 28. A method for altering an environmental stress tolerance of a plant 

2 comprising: 

3 transforming a plant with a sequence encoding a binding protein 

4 according to any one of claims 1-8; 

5 expressing the binding protein within the plant; and 

6 stimulating expression of at least one environmental stress tolerance 
1 gene through binding of the binding protein to a DNA regulatory sequence 

8 comprising CCG which regulates expression of one or more environmental 

9 stress tolerance genes in the plant. 

1 29. A method according to claim 28 wherein the promoter is selected such 

2 that the plant expresses the one or more environmental stress tolerance genes 

3 at a level which differs from a level at which the one or more environmental 

4 stress tolerance genes are expressed in the plant's native state. 
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1 30. A method according to claim 28 wherein the promoter is selected such 

2 that the plant expresses the one or more environmental stress tolerance genes 

3 under environmental conditions at which the plant does not express the binding 

4 protein in the plant's native state. 
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FIGURE ISA 

bjCBPl Species=Brassica juncea LenQth=577 fSFn rn m« ^ai 



bjCBP2 SpeciesaBrassica juncea Lenqth=43i rsEo rn m« 

GAACCAAACAAGAGATCTAGGATCTGGCTCGGTACTTTCCTS^CGCr^^^ 

CGCGCTCACGACGTCGCCGCCATAGCCCTCCGTGGSSTcJSS?iTrTrAl?^^ 

GACTCGGCTTGGCGGCTCCGTATCTCGGAGACAA?SsS?CTSSSA5S5?;SISr? 

???g\^?g°?SSL=§SSS?^^^^^^^^^ 

?i?c?l??tI^cg;?i^§i^^^^^^^ 

GAGCCAAACAAGAAATCTAGGATCTGGCCCGGTACT^TCCrS^TCSSGA^rr^^^ 
CGCGCTCACGACGTCGCCGCCATAGCCCTCCGTGGCAAAScGSTGTPTra^^^ 

°^s:^^°°^^°gc°°ctccgtatcccggagacs?s^5c?^ss?jjjss 

||?1a??Ia^§^^^^^^^ 
||?^J^S??G^?S?S^^^^^^^^ 

bnCBFl Species=Brassica napus LenQth=632 r<5irri rn m« ^ci 

CACCCGATATACCGGGGAGTTCGTCTGAGAAl§fSGGTLG?^GTCTS?GAllinArr 

?^^?a^CAAGAAATCTAGAATTTGGCTTGGAACTTTSASSG??S?A?^^^ 

CGTGCTCACGACGTCGCTGCCCTAGCCCTCCGTGGAAG^^GCGCCTGCOTCaIS^a^^^ 

^^$J^2°P'^°GCGGCTCCGCATCCCGGAGACAACCTGCSSSS2?S?C^^ 

°?I?"?^°«=CGCATTGGCTTTTGAGGCTaAGAAAAGTGATSGAcS?S^ 

cagaacatggaggagacgacggcggtggcttctcaggctgaagtgaatSScga^I^^^ 

GAACATGGCATGAACATGaAGGAGGCAACGGCAGTGGCTTCTSGGj?SS^^ 
^^fp??t^S°^r5^J2SS°^^'=^^^^°°A<=°AGACAATa^^^ 



^^™°''°^°°°^^'^^'=ATGGCGAAGGAGTCGACGGTGGAGGCTC^^ 

gttcgtctgagaaagtcaggtaagtgggtgtgtgS?5?SagSJSccSISJg^^ 
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FIGURE iSA-coDtinued 



AGAATTTGGCCCGGAACTTTCAAAACAGCTGAGATGGCAGCTCGTGCTCACGACGTCGCT 
GCCCTAGCCCTCCGTGGAAGAGGCGCCTGCCTCAATTATGCGGACTCGGCTTGGCGGCTC 
CGCATCCCGGAAACAACCTGCCACAAGGATATCCAGAAGGCTGCTGCTGAAGCCGCATTG 
GCTTTTGAGGCTGAGAAAAGTGATGTGACGATGCAAAATGGCCTGAACATGGAGGAGACG 
ACGGCGGTGGCTTCTCAGGCTGAAGTGAATGACACGACGACAGAACATGGCATGAACATG 
GAGGAGGCAACAGCGGTGGCTTCTCAGGCTGAGGTGAATGACACGACGACAGATCATGGC 
GTAGACATGGAGGAGACGATGGTGGAGGCTGTTTTTACGGAGGAACAAAGTGAAGGGTTC 
AACATGGCGGAGGAGTCGACGGTGGAGGCTGCTGTTGTTACGGATGAACTGAGCAAAGGA 
TTTTACATGGACGAGGAGTGGACGTACGAGATGCCGACCTTGTTGGCTGATATGGCGGCA 
GGGATGCTTTTGCCGCCACCATCTGTACAATGGGGACATAATGATGACTTGGAAGGAGAT 
GCGGACATGAACCTCTGGAGTTATTAAGGATCCGCG i^vjAALK^At,Ar 

baCBF3 SpeciesaBrassica napus Length=884 (SEQ ID No 501 

actacactcagccttatccagtttttttcaaaagatttttcaacaatgaacacattccct 
^^°;!^S^^;E°^t°°'^tggctccgagaacgagtctccggttactacggtagcaggaggt 

GATTATTATCCCATGTTGGCGGCAAGCTGTCCGAAGAAGCCAGCAGGTAGGAAGAAGTTT 

caggagacacgtcaccccatttaccgaggagttcgtctgagaaagtcaggtaagtgggtg 

TGTGAAGTGAGGGAACCAAACAAGAAATCTAGAATTTGGCCCGGAACTTTCAAAACAGCT 

gagatggcagctcgtgctcacgacgtcgctgccctagccctccgtggaagaggcgcctgc 

CTCAATTATGCGGACTCGGCTTGGCGGCTCCGCATCCCGGAGACAACCTGCCACAAGGAT 

atccagaaggctgctgctgaagccgcattggcttttgaggctgagaaaagtgatgtgacg 

ATGCAAAATGGCCTGAACATGGAGOAGACGACGGCGGTGGCTTCTCAGGCTGAAGTGAAT 

gacacgacgacagaacatggcatgaacatggaggaggcaacggcagtggcttctcaggct 

GAGGTGAATGACACGACGACGGATCATGGCGTAGACATGGAGGAGACAATGGTGGAGGCT 

gtttttactggggaacaaagtgaagggtttaacatggcgaaggagtcgacggtggaggct 

GCTGTTGTTACGGAGGAACCGAGCAAAGGATCTTACATGGACGAGGAGTGGATGCTCGAG 
ATGCCGACCTTGTTGGCTGATATGGCGGAAGGGATGCTTTTGCCGCCGCCGTCCGTACAA 
TGGGGACAGAATGATGACTTCGAAGGAGATGCTGACATGAACCT ^v-i^ i av-aa 

bnCBF4 Species=Brassica napus Length=874 [SEQ ID No 52] 

GTAATTCGATTACCGCTCGAGTACTTACTATACTACACTCAGCCTTATCCAGTTTrrCAA 

AAGAAGTTTTCAACTATGAACTCAGTCTCTACTTTTTCTGAACTTCTTGGCTCTGAGAAC 

GAGTCTCCGGTAGGTGGTGATTACTGTCCCATGTTGGCGGCGAGCTGTCCGAAGAAGCCG 

GCGGGTAGGAAGAAGTTTCGGGAGACACGTCACCCCATTTACCGAGGAGTTCGCCTTAGA 

AAATCAGGTAAGTGGGTGTGTGAAGTGAGGGAACCAAACAAAAAATCTAGGATTTGGCTC 

GGAACTTTCAAAACAGCTGAGATCOCAGCTCGTGCTCACGACGTCGCCGCCTTAGCTCTC 

CGTGGAAGAGGCGCCTGCCTCAACTTCGCCGACTCGGCTTGGCGGCTCCGTATCCCGGAG 

ACAACCTGCGCCAAGGATATCCAGAAGGCTGCTGCTGAAGCCGCATTGGCTTTTGAGGCC 

GAGAAGAGTGATACCACGACGAATGATCATGGCATGAACATGGCTTCTCAGGCCGAGGTT 

AATGACACAACGGATCATGGCCTGGACATGGAGGAGACGATGGTGGAGGCTGTTTTTACT 

GAGGAGCAGAGAGACGGGTTTTACATGGCGGAGGAGACGACGGTGGAGGGTGTTGTTCCG 

GAGGAACAGATGAGCAAAGGGTTTTACATGGACGAGGAGTGGATGTTCGGGATGCCGACC 

TTGTTGGCTGATATGGCGGCAGGGATGCTCTTACCGCCGCCGTCCGTACAATGGGGACAT 

AATGATGACTTCGAAGGAGATGTTGACATGAACCTCTGGAATTATTAGTACTCATATTTT 

TTTAAATTATTTTTTGAACGAATAATATTTTATT v^xmuiuaiai i 

bnCBFl Species=Brassica napus Length=898 [SEQ ID No 54] 

AATAAATATCTTATCAAACCAGTCAGAACAGAGATCTTGTTACTTACTATACTACACTCA 

GCCTTATCCAGTTTTCAAAAAAAGTATTCAACGATGAACTCAGTCTCTACTTTTTCTGAA 

CTGCTCCGCTCCGAGAACGAGTCTCCGGTTAATACGGAAGGTGGTGATTACATTTTGGCG 

GCGAGCTGTCCCAAGAAACCTGCTGGTAGGAAGAAGTTTCAGGAGACACGCCACCCCATT 

TACAGAGGAGTTCGTCTGAGGAAGTCAGGTAAGTGGGTGTGTGAAGTGAGGGAACCAAAC 

AAGAAATCTAGAATTTGGCTCGGAACTTTCAAAACAGCTGAGATCGCAGCTCGTGCTCAC 

GACGTTGCCGCCTTAGCTCTCCGTGGAAGAGGCGCCTGCCTCAACTTCGCCGACTCGGCT 

TGGCGGCTCCGTATCCCGGAGACGACCTGCGCCAAGGATATCCAGAAGGCTGCTGCTGAA 

GCCGCATTGGCTTTTGAGGCCGAGAAGAGTGATACCACGACGAATGATCATGGCATGAAC 

^^°°^;£J^^5^*^°'^'r°A°°TTAATGACACGACGaATCATGACCTGGACATGGAGGAGACG 

ATAGTGGAGGCTGTTTTTAGGGAGGAACAGAGAGAAGGGTTTTACATGGCGGAGGAGACG 

*^°°J;$^°^°J°^J°TTCCGGAGGAACAGATGAGCAAAGGGTTTTACATGGACGAGGAG 

TGGATGTTCGGGATGCCGACCTTGTTGGCTGATATGGCGGCAGGGATGCTCTTACCGCTG 

f^2I*^'^°'^^^^'^°°'^°^^^"^^T°ATGACTTCGAAGGAGATGCTGACATGAACCTCTGG 

AATTATTAGTACTCATATTTTTTTAAATTATTTTTTGAACGAATAATATTTTATTGAA 

bnCBFS Species=Brassica napus Length=ll32 [SEQ ID No 56] 

GATTACCGCTCGAGTACTTACTATACTACACTCAGCCTTATCCAGTTTTTC^CAAAAGAT 

TTTTCAACAATGAACACATTCCCTGCTTCCACTGAAATGGTTGGCTCCGAGAACGAGTCT 
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TACACTCCAAATTTAGTTGAAATAcSI?I§S?°SS^J?SISSSJ^^^^ 
bnCBF7 Species=Brassica napus LenQth-76a rcjpn rn c«i 

bnCBFS Species=Brassica napus L.enQth=95i rcipn rn 
bnCBP9 Species»Brassica napus Length=889 fSEo rn Mr^ c-^n 



ATGGCGGAAGGGATGCTTTTGCCGCCGCCGTCCGTACAATGGGGAakGli?S?G£S5 
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GAAGGAGATGCGCACATGAACCTCTGGAGTTATTAAGGATCCGCGAATC 

boCBPl Species=Brassica oleracea Length=563 CSEQ ID No. 64] 

CACCCTATCTACCGGGGAGTTCGCCTGAGAAAGTCAGGTAAGTGGGTGTGTGAAGTGAGG 

GAGCCAAACAAGAAATCTAGGATTTGGCTTGGAACTTTCAAAACCGCAGAGATCGCTGCT 

CGTGCTCACGACGTTGCCGCCTTAGCCCTCCGTGGAAGAGCGGCCTGTCTCAACTTCGCC 

GACTCGGCTTGGCGGCTCCGTATCCCGGAGACAACTTGCGCCAAGGATATCCAGAAGGCT 

GCTGCTGAAGCTGCGTTGGCTTTTGGGGCCGAAAAGAGTGATACCACGACGAATGATCAA 

GGCATGAACATGGAGGAGATGACGGTGGTGGCTTCTCAGGCTGAGGTGAGCGACACGACG 

ACATATCATGGCCTGGACATGGAGGAGACTATGGTGGAGGCTGTTTTTGCTGAGGAACAG 

AGAGAAGGGTTTTACTTGGCGGAGGAGACGACGGTGGAGGGTGTTGTTACGGAGGAACAG 

AGCAAAGGGTTTTATATGGACGAGGAGTGGACGTTCGGGATGCAGTCCTTTTTGGCCGAT 

ATGGCTGAAGGCATGCTCTTTCC 

boCBP2 Species^Brassica oleracea Length=533 [SEQ ID No. 66] 

GAAACATAGATCTTTGTACTTACTATACTTCACCTTATCCAGTTTTATTTTTTTATTTAT 

AAAGAGTTTTCAACAATGACCTCATTTTCTACCTTTTCTGAACTGTTGGGCTCCGAGCAT 

GAGTCTCCGGTTACATTAGGCGAAGAGTATTGTCCGAAGCTGGCCGCAAGCTGTCCGAAG 

AAACCAGCCGGCCGGAAGAAGTTTCGAGAGACGCGTCACCCAGTTTACAGAGGAGTTCGT 

CTGAGAAACTCAGGTAAGTGGGTGTGTGAAGTGAGGGAGCCAAACAAGAAATCTAGGATT 

TGGCTCGGTACTTTCCTAACAGCCGAGATCGCAGCCCGTGCTCACGACGTCGCCGCCATA 

GCCCTCCGCGGCAAATCAGCTTGTCTCAATTTTGCCGACTCCGCTTGGCGGCTCCGTATC 

CCGGAGACAACATGCCCCAAGGAGATTCAGAAGGCGGCTGCTGAAGCCGCGGTGGCTTTT 

AAGGCTGAGATAAATAATACGACGGCGGATCACGGCCTCGACATGGAAGAGAC 

boCBF3 Species-Brassica oleracea Lengths=887 (SEQ ID No. 681 

ACTCAGCCTTATCCAGTTTTTCTCAAAAGATTTTTCAACAATGAACAGATTCCCTGCTTC 

CACTGAAATGGTTGGCTCCGAGAACGAGTCTCCGGTTACTACGGTAGTAGGAGGTGATTA 

TTATCCCATGTTGGCGGCAAGCTGTCCGAAGAAGCCAGCGGGTAGGAAGAAGTTTCAGGA 

GACACGTCACCCCATTTACCGAGGAGTTCGTCTGAGAAAGTCAGGTAAGTGGGTGTGTGA 

AGTGAGGGAACCAAACAAGAAATCTAGAATTTGGCTTGGAACTTTCAAAACAGCTGAGAT 

GGCAGCTCGTGCTCACGACGTGGCTGCCCTAGCCCTCCGTGGAAGAGGCGCCTGCCTCAA 

TTATGCGGACTCGGCTTGGCGGCTCCGCATCCCGGAGACAACCTGCCACAAGGATATCCA 

GAAGGCTGCTGCTGAAGCCGCATTGGCTTTTGAGGCTGAGAAAAGTGATGTGACGATGGA 

GGAGACGATGGCGGTGGCTTCTCAGGCTGAAGTGAATGACACGACGACAGATCATGGCAT 

GAACATGGAGGAGGCAACAGCGGTGGCTTCTCAGGCTGAGGTGAATGACACGACGACAGA 

TCATGGCGTAGACATGGAGGAGACGATGGTGGAGGCTGTTTTTACGGAGGAACAAAGTGA 

AGGGTTCAACATGGCGGAGGAGTCGACGGTGGAGGCTGCTGTTGTTACGGATGAACTGAG 

CAAAGGATTTTACATGGACGAGGAGTGGACGTACGAGATGCCGACCTTGTTGGCTGATAT 

GGCGGCAGGGATGCTTTTGCCGCCACCATCTGTACAATGGGGACATAATGATGACTTGGA 

AGGAGATGCGGACATGAACCTCTGGAGTTATTAATACTCGTATTTTT 

boCBF4 SpeciesaBrassica oleracea Length=950 [SEQ ID No. 70] 

CTGAAAAGAAGATAAAAGAGAGAGAAATAAATATCTTATCAAACCAGACAGAACAGAGAT 

CTTGTTACTTACTATACTACACTCAGCCTTATCCAGTTTTTCAAAAGAAGTTTTCAACTA 

TGAACTCAGTCTCTACTTTTTCTGAACTTCTTGGCTCTGAGAACGAGTCTCCGGTAGGTG 

GTGATTACTGTCCCATGTTGGCGGCGAGCTGTCCGAAGAAGCCGGCGGGTAGGAAGAAGT 

TTCGGGAGACACGTCACCCCATTTACCGAGGAGTTCGCCTTAGAAAATCAGGTAAGTGGG 

TGTGTGAAGTGAGGGAACCAAACAAAAAATCTAGGATTTGGCTCGGAACTTTCAAAACAG 

CTGAGATCGCAGCTCGTGCTCACGACGTCGCCGCCTTAGCTCTCCGTGGAAGAGGCGCCT 

GCCTCAACTTCGCCGACTCGGCTTGGCGGCTCCGTATCCCGGAGACAACCTGCGCCAAGG 

ATATCCAGAAGGCTGCTGCTGAAGCCGCATTGGCTTTTGAGGCCGAGAAGAGTGATACCA 

CGACGAATGATCATGGCATGAACATGGCTTCTCAGGCTGAGGTTAATGACACGACGGATC 

ATGGCCTGGACATGGAGGAGACGATGGTGGAGGCTGTTTTTACTGAGGAGCAGAGAGACG 

GGTTTTACATGGCGGAGGAGACGACGGTGGAGGGTGTTGTTCCGGAGGAACAGATGAGCA 

AAGGGTTTTACATGGACGAGGAGTGGATGTTCGGGATGCCGACCTTGTTGGCTGATATGG 

CGGCAGGGATGCTCTTACCGCCGCCGTCCGTACAATGGGGACATAATGATGACTTCGAAG 

GAGATGCTGACATGAACCTCTGGAATTATTAGTACTCGTAXTTTTTTAAATTATTTTTTG 

AACGAATAATATTTTATTGAATTCGGATTCTACCTGTTTTTTTAATGGAT 

b CBP5 Species=Brassica oleracea Length=877 [SEQ ID No, 72] 

ACCGCTCGAGCAACAATGAACACATTCCCTGCTTCCACTGAAATGGTTAGCTCCGAGAAC 

GAGTCTCCGGTTACTACGGTAGTAGGAGGTGATTATTATCCCATGTTGGCGGCAAGCTGT 

CCGAAGAAGCCAGCGGGTAGGAAGAAGTTTCAGGAGACACGTCACCCCATTTACCGAGGA 

GTTCGTCTGAGAAAGTCAGGTAAGTGGGTGTGTGAAGTGAGGGAACTAAACAAGAAATCT 

AGAATTTGGCTTGGAACTTTCAAAACAGCTGAGATGGCAGCTCGTGCTCACGACGTGGCT 
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fiifJiiiii^lSEIiiL 



CAGCTGA 
TGCCT 
TAT 



GG 
,CTTT 
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FIGURE ISA-continued 

TGACTTCGAAGGAGATGCTGACATGAACCTCTGGA 

brCBFS Species-Brassica rapa Length=a32 [SEQ ID No 821 
ACCGCTCGAGTACTTACTATACTACACTCAGCCTTATCCAGTTTTTCTTCCAACGATGGA 

ctcaatctctacttttcctgaactgcttggctcagagaacgagtctccggSactacgot 

agtaggaggtgattattgtcccaggttggcggcaagctgtccgaagaagcSgcgSS^S 

gaagaagtttcaggagacacgtcaccccatttaccgtggagttcgtttaSSa^g???^ 

taagtgggtgtgtgaagtgagggaaccaaacaagaaatctaSattSg??JSS?ISct 

caaaaccgctgagatcgctgctcgtgctcacgacgttgctgccttagccctcmc^g^ 

aggcgcctgcctcaacttcgccgactcggcttggcggctcSStcccgS^^^ 

cgccaaggatatccagaaggctgctgctgaagctgctttggcttSSggc?5SS^^^^ 

tgatcatggcatgaacatgaagaatactacggcggtggcttctSSi?1aS^^^ 

tacgacgacggaccatggcgtgqacatggaggagacgttggtggaggctgtttttacgga 

n™n™™°°"^^^^^°A<=°<^AGGAGACGAGGGTGGAGGGTGTTGTTA^^ 

ggaacagaacaattggttttacatggacgaggagtggatgtttgggatgccgacgttgtt 
ggttgatatggcggaagggatgcttataccgcggcagtccgtacaatcgggacacScgI 
tgacttcgaaggagatgctgacatgaacctctggaattattagggatccgcg 

brCBFS Species=Brassica rapa Length=83 0 [SEQ ID No 841 

tactacactcagccttatccagttttcaaaaaaagtattcaactatgaactcagtctcta 

ctttttctgaactgctctgctccgagaacaagtctccggttaatacggaagg^ 

acattttggcggcgagctgtcccaagaaacctgctggtaggaagaagtttSSSSS 

gccaccccatttacagaggagttcgcctaagaaagtcaggtaagtggSgtctgaaS??? 

GGGAACCAAACAAGAAATCTAGAATTTGGCTCGGAACTTTCAAAACAGaTGAS^^ 

ctcgtgctcacgacgtcgccgccttagctctccgtggaagaggcgcctgcctcaactSg 

CCGACTCGGCTTGGCGGCTCCGTATCCCAGAGACAACCTGCGCCAAGGATATCCAGAAGG 

ctgctgctgaagccgcattggcttttgaggccgagaagagtgataccacScgStJa^ 
gtggcatgaacatggaggagacgtccgcggtggcttctccggctgagttga^tgaScga 
cggcggatcatggcctggacatggaggagacgatggtggaggctgtttttagggacgSc 
agagagaagggttttacatggcggaggagacgacggtggagggtgttgttccggaggaa^ 

AGATGAGCAAAGGGTTTTACATGGACGAGGAGTGGACGTTCGAGAiGcSlGG?^^ 

ctgatatggcggaagggatgcttctgcctcccccgtccgtacaatggggacataacStg 

ACTTCGAAGGAGATGCTGACATGAACCTCTGGAATTATTAGGGATCCGCG 

brCBP7 Species-Brassica rapa Length=aS4 [SEQ ID No 861 
CTATACTACACACAGCCTTATCCAGCCGCTCGAGTACTTACTATACTACACTCAGCCTTT 

tccagtttttcaaaagaagttttcaacgatgaactcagtctctactctSctga^^^ 

TGGCTCCCAGAACGAGTCTCCCGTAGGTGGTGATTACTGTCCCATGTTGGCGGCGAGCTG 

tccgaagaagccggcgggtaggaagaagtttcgggagacacgtcacccSJttaE^^^ 

agttcgtcttagaaagtcaggtaagtgggtgtgtgaagtgagggaacSaacSgaStc 

taggatttggctcggaactttcaaaacagctgaqatcgcagctcgtgctcacgacgt^ 

^°^cttagctctccgtggaagaggcgcctgcctcaacttcgSSSS§^ 

ccgtatcccggagacaacctgcgccaaggatatccagaaggctgctgctgaagccgcatt 

GGCTTTTGAGGCGGAGAAGAGTGATACCACGACGACGAATGATCATGGCATGAACATGGC 

TTCTCAGGTTGAGGTTAATGACACGACaGATCATGACCTGGASTGSSAScS?^? 

l^°°^?fEI^^?5?t5<=AACAGAGAGAAGGGTTTTACATSa^^^ 

GGAGGGTATTGTTCCGQAGGAACAGATGAGCAAAGGGTTTTACATGGACGAGGAGTGGAT 

GTTCGGGATGCCGACCTTGTTGGCTGATATGGCGGCAGGGATGCTCTTACCGCC^ 

??T^^^??^°°^cataatgatgacttcgaaggagatgctSI^^^^ 

TTAAGGGATCCGCG 

gmcayi Species=Glycine max Length=73 8 [SEQ ID No 381 
CATCCGATTTATAGTGGCGTGAGGAGGAGGAACACGGATAAGTGGGTAAGTGAGGTGAGG 
^^°^^^^^^^GACCAGGATTTGGCTGGGGACTTTTjJScG?aS5SA^^ 
CGGGCCCACGACGTGGCGGCAATGGCCCTGAGGGGCCGGTATGCCTGTCTCAACTTCGCT 

gactcgacgtggcggttaccaattcccgccactgctaacgcaaaSSSSagaa^^ 

gcagcagaggctgccgaggctttcagaccaagtcagaccttagaSatac^^ 

caagagtgtgtaaaagtggtgacgacaacaacgatcacagaacaaaaacgaggaa 

tatacggaggaagaagagcaagtgttagatatgcctgagttgcttaggStatg^^^ 

atgtccccaacacattgcatagggtatgagtatgaagatgctgaStggatgct?!^^ 

°S:E°^°°''°'''==='^^'^°°A°T'^TCTCAATTrAATAACGTGCT?T?GOT 

GTTAGTTTTGGAGTGTGACTGTCTGTACTGGTTTTTTATTAGTAGTACGGATAC^^^ 

TAGGTGGCAGATTGAAAGGGACCAAAAGGAATTTTCTTTTGAAACCC^?TTTGTCA^^ 
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rsCBFl Species= Raphanus sativus Length=793 [SEQ ID No 90l 
ACTACACTCAGCCTTATCCAGTTTTTCTTCCAACGATGGACTCAATCTCTACTTTTTCTG 

aactgcttggctccgagaacgagtctccggttactacggtagtaggaggtgaSattttc 
ccaggttggcggcaagctgtccgaagaagccagcgggtaggaagaagtttcSgacac 
gtcaccccatttaccgcggagttcgtttaagaaagtcaggtaagtgggtgtgtgaaot^^ 

GGGAACCAAACAAGAAATCTAGGATTTGGCrrCGGAACTTTCAAAACCGCTGAaATCGCTO 

ctcgtgctcacgacgttgctgccttagccctccgcggaagaggcgcSScctc^SSg 
ccgactcggcttggcggctccgtatcccggagacaacctgcgccaaSata?cJJ§SSg 

^^°"°"°^°^TGCATTGGCTTTTGAGGCCaAGAAGAGTGATCA?GGS?SS?i?G^ 

f^°^^^^J^f°°^'^T°°C'I^CTCAGGTTGAGGTGAATGACACGACGACGGACCATC^^ 

TGGACATGGAGGAGACGTTGGTGGAGGCTGTTTTTACGGAGGAACAGAGAGAAGGGTTTT 

acatgacggaggagacgagggtggagggtgttgttacggaggaacagaacaat?StS 
acatggacgaggagtggatgtttgggatgccgacgttgttggttgatatggcggaagSI 

raCBP2 Species- Raphanus sativus Length»682 [SEQ ID No 921 

acacctaaaccttatccaggtttaactttttttttcataaagagttttcaacaatgacca 

cattttctaccttttccgaaatgttgggctccgagtacgagtctccggttacattaggcg 

gagagtattgtccgaagctggccgcgagctgtccgaagaaaccagctggtcgtaagSot 

ttcgagagacgcgccacccaatatacagaggagttcgtctgagaaactcagSaagtgg^ 

tgtgtgaagtgagggagccaaacaagaaatctaggatttggctcggtactttcctScS 

ccgagatcgcagcgcgtgcccacgacgtcgccgccatagccctccgcggcaaatccgcat 

gtctcaatttcgctgactcggcttggcggctccgtatcccggagacaacatgcccSSSS 

atatacagaaggcggctgctgaagccgcggtggcttttcaggctgagataaatgatacgS 

cgacggatcatggcctggacttggaggagacgatcgtggaggctatttti^gSaggtS 

acaacgatgagttttatatggacgaggagtccatgttcgggatgccgtctttgttggcta 

GTATGGCGGAAGGGATGCTTTTGCCGCTGCCGTCCGTACAATCTGAACATAACTGTGACT 
TCGACGGAGATGCTGACATGAA v.««.v.Ax-rtAi_ 1 i OAt. J. 

zmCBFl Species=Zea maize Length=349 [SEQ ID No 94] 

CGGAGTCCGCGGACGGCGGCGGCGGCGGCGACGACGAGTACGCGACGGTGCTGTCGGCGC 

CACCCAAGCGGCCGGCGGGGCGGACCAAGTTCCGGGAGACGCGGCACCCCGTGTACCGCG 

GCGTGCGGCGGCGCGGGCCCGCGGGGCGCTGGGTGTGCGAGGTCCGCGAGCCCAACAAGA 

^^^^°S°?^T^'^^°<=TCGGCACCTTCGCCACCCCCGAGGCCGCCGCGCGCGCGCACGACG 

TGGCCGCGCTGGCCCTGCGGGGCCGCGCCGCGTGCCTCAACTTCGCCGACTCGGCGCGCC 

TGCTCCAAGTCGACCCCGCCACGCTCGCCACCCCCGACGACATCCGCCG 
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FIGURE 18B 



BJCBFl-PBP Species=Brassica juncea length»130 fSEO ID No 391 
BCrCBP2-PEP Species=Brassica juncea lenath-l43 fqpn rn m^^ ^^i 



BJCBF3-PEP Species=Brassica juncea lenath-143 fSEn rn Mr^ 



BJCBF4-PEP Species=Brassica juncea lenath=i42 rsEO rn Mr> 

BNCBF2-PEP Species«Brassica napus lenath-29T r^Tm rn aoi 
MNTFPASTEMVGSENESPVTTWGGDYYPME^a|^AaR?L-QE??HP?Y?GVllill 

sgkwvcevrepnkksriwpgtfktaemaarahdvaaiSrgr^SSJSsSl^^ 
tctkdiqkaaaeaaij^eaeksdvtmqngi^eett/^qSvS^ 

VASQAEVNDTTTDHGVDMEETMVEAVFTEEQSEGFNN^STVES?J?SSSS??Si 
EWTYEMPTLLADMAAGMLLPPPSVQWGHNDDLEGDAD^WSY 

BMCaF3-PBP Species»Brassica napus lenqth=279 fSEO td ktq =;t 1 

rWrFPASTEMVGSENESPVTWAGGDYYPMLAASCP^PAGRKKFQET?^^ 

SGKWCEVREPNKKSRIWPGTFKTAEMAARAHDVAASLRaRS^SSsS^pS 

TCHroiQKAAAEAAIAFEAEKSDVTMQNGUmEET^T^QAES 

VASQAEVNDTTTDHGVDMEETMVEAVFTGEQSEGFN^^STV^ 

EWMLEMPTLLADMAEGMLLPPPSVQWGQNDDFEGDAD^^^^^ 

BNCBF5-PaP Specie3=Brassica napus lenqth=25l fSEO rn mo "!>;i 
^WSVSTFSELLRSENESPVln:EGGDYIIJ^CPKKlAGRKKF6ETI^p?re 

wvcevrepnkksriwlgtfktaeiaarahdvaalalrgr^cSfSsISS^^ 

KDIQKAAAEAAIJ^EAEKSDTTTNDHGM^mASQ^;E^^TTD 
EGDSS^Sr^'^^'^^^"^^^™^^^""^^"^^^^^^ 

BNCBP6-PBP Species^Brassica napus lenQth=277 fSEO rn mq ^71 
JSSS^^^^^;^^ftfr^2^!^'^^'^^^^^TTTDHG^lNMEEATAVASQAE 



BMCBF7-PBP Species^Brassica napus lenQth=213 fqpn rn Mr. =:qi 
ElQKAAAEAAVAFKAEINNTTADHGIDVEETIVEAIFTEENi;DGP?SSEsS?S2 
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FIGURE 1 SB-continued 

ASMAEGMLLPPPSVQFGHTYBFDGDADVSLWSY 



TCHKDIQKAAAEAAIAFEAEKSD^ri^STMA^^^^rS™^^ 



'SKoOJSlclGJSiSS^^tf,'"' "^^ :.e=gth.X24 tSEO ID No. 751 
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FIGURE 1 SB-continued 



BRCBP4-PEP Species=Brassica rapa Length=112 [SEQ ID No. ai] 

MDSISTFPELLGSENESPVTTWGGDYCPRLAASCPKKPAGRKKFQETRHPIYRGVRLRK 

SGKWVCEWEPNKJCSRIWLGTFKTAEIAARAHDVAAIiALRGRGACLNFADSA 

BRCBP5-PEP SpeciessBrassica rapa Length=2S5 [SEQ ID No. 83] 

MDSISTFPELLGSENESPVTTWGGDyCPRLAASCPKKPAGRKKFQETRHPIYRGVRLRK 

SGKKV^CEVREPNKKSRIWLGTFKTAEIAARAHDVAALALRGRGACLNFADSAWRLRIPET 

TCAKIDIQKAAAEAALAFEAEKSDHG^INMKNTTAVASQVEVNDTTTDHGVDMEETLVEA 

TEEQREGFYMTEETRVEGWTEEQNNWFYMDEEWMFGMPTLLVDMAEGMLIPRQSVQSGH 

YDDFEGDADMNLWNY 

BRCBF6-PBP Species=Brassica rapa Length=258 [SEQ ID No. 85] 

MNSVSTFSELLCSENKSPVNTEGGDyiLAASCPKKPAGRKKFQETRHPIYRGVRLRKSGK 

WCEVREPNKKSRIWLGTFKTAEIAARAHDVAALALRGRGACLNFADSAWRLRIPETTCA 

KDIQKAAAEAA1AFEAEKSDTTTNDRGMNMEETSAVASPAELNDTTADHGLDMEETM\^ 

VFRDEQREGFYMAEETTVEGVVPEEQMSKGFYMDEEWTFEMPRLXiADMAEGMLLPPPSVQ 

WGHNDDFEGDADMNLWNY 

BRCBP7-PEP Species=Brassica rapa Length=251 [SEQ ID No. 87) 

MNSVSTLSEVLGSQNESPVGGDYCPMLAASCPKKPAGRKKFRETRHPIYRGVRLRKSGKW 

VCEVREPNKKSRIWLGTF KTAEIA AJlAHDVAAIiALRGRGACLNFADSAWRLRIPETTCAK 

DIQKAAAEAAIAFEAEKSDTTTTNDHGMNMASQVEVNDTTDHDLDMEET^^^ 

EGFYMAEETTVEGIVPEEQMSKGFYMDEEWMFGMPTLLADMAAGMLLPPPSVQWGHNDDF 

EGDADMNLWKY 

GMCSPl-PEP SpeciessGlycine max t.ength=170 [SEQ ID No. 89] 
HPIYSGVRRRNTDKV^SEVREPNKKTRIWLGTFPTPEMAARAHDVAAMALRGRYACLNFA 
DSTWRLPIPATANAKDIQKAAAEAAEATRPSQTLENTNTKQECVKVVTTTTITEQKRGMF 
YTEEEEQVLDMPELLRNMVLMSPTHCIGYEYEDADLDAQDAEVSLWSFSX 

RSCBFl-PBP SpeciessRaphanus sativus Length=252 [SEQ ID No. 91] 
MDSISTFSELLGSENESPVTTWGGDYFPRIJSlASCPKKPAGRKKFQETRHPIYRGVRLRK 

sgkwvcevhepnkksriwlgtfktaeiaarahdvaalalrgrgaclnfadsawrlripet 
tcakdiqkaaaeaalafeaeksdhgmnmknttavasqvevndtttdhgvdmeetlveavf 
teeqregfymteetrvegvvteeqnnwfymdeewmfgmptllvdmaegmllprpsvqsgh 

YDDFEGDADMNL 

RSCBF2-PEP Species aRaphanus sativus Lengths=2Q9 [SEQ ID No. 93] 

MTTFSTFSEMLGSEYESPVTLGGEYCPKIiAASCPKKPAGRKKFRETRHPIYRGVRLRNSG 

KWVCEVREPNKKSRIWLGTFLTAEIAARAHDVAAIALRGKSACLNFADSAWRLRIPETTC 

PKDIQKAAAEAAVAPQAEINDTTTDHGLDLEETIVEAIFTEVNNDEFYMDEESMFGMPSL 

LASMAEGMLLPLPSVQSEHNCDFDGDADM 

2MCBP1-PBP Species=Zea maize Length=115 [SEQ ID No. 95) 

ESADGGGGGDDEYATVLSAPPKRPAGRTKFRETRHPVYRGVRRRGPAGRWVCEVREPNKK 

SRIWLGTFATPEAAARAHDVAALALRGRAACLNFADSARLLQVDPATLATPDDIR 
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c{bnCBF6C} qsegfnmaee stveaawtd elskgfYMd. eEwtyeMptl ladmAaGMLl 

c{boCBF3C} qsegfnmaee stveaawtd elskgfYMd. eEwtyeMptl ladmAaGMLl 

c{bnCBF2C} qsegfnmaee stveaawtd elskgfYMd. eEwtyeMptl ladmAaGMLl 

c{boCBF5C} qsegfnmake staeaawte elskgvYMd. eEwtyeMptl ladmAaGMLl 

c{brCBF5C} qregfymtee trvegwtee q.nnwfYMd. eEwmfgMptl IvdmAeGMLi 

cjrsCBFlC} qregfymtee trvegwtee q.nnwfYMd. eEwmfgMptl IvdmAeGMLi 

c{bnCBF4C} qrdgfymaee ttvegwpee qmskgfYMd. eEwmfgMptl ladmAaGMLl 

c{boCBF4C} qrdgfymaee ttvegwpee qmskgfYMd. eEwmfgMptl ladmAaGMLl 

c{bnCBF5C} qregfymaee ttwgwpee qmskgfYMd. eEwmfgMptl ladmAaGMLl 

c{brCBF7C} qregfymaee ttvegivpee qmskgfYMd. eEwmfgMptl ladmAaGMLl 

c{brCBF6C} qregfymaee ttvegwpee qmskgfYMd. eEwtfeMprl ladmAeGMLl 

cjboCBFlC} qregfylaee ttvegwtee q.skgfYMd. eEwtfgMqsf ladmAeGMLf 

c{bjCBF2C} esse gfYMd. eEfmfgMptl wasmAeGMLl 

c{bjCBF3C} esse gfYMa. eEfmfgMptl wasvAeGMLl 

c{bnCBF7C} ennd gfYMde eEsmfgMpal lasmAeGMLl 

c(rsCBF2C} vnnd efYMd. eEsmfgMpsi lasmAeGMLl 

cjatCBFlC} qseg afYMd. eEtmfgMptl IdnmAeGMLl 

c{atCBF2C} qsqd, . , .afYMd, eEamlgMssl IdnmAeGMLl 

c{atCBF3C} qsen afYMh. dEamfeMpsl lanmAeGMLl 

Consensus YM-- -B M A-GML- 
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FIGURE 21B 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 



(i) APPLICANT: Michael Thomashow, Eric Stockinger, Kirsten 
Jaglo-Ottosen, Sarah Gilmour, Daniel Zarka and Cai-2hong Jiang 

^^^^^ TITLE OF INVENTION: PLANT HAVING ALTERED ENVIRONMENTAL 
STRESS TOLERANCE wiNiir^iN i^i. 



(iii) NUMBER OF SEQUENCES: 100 

(A) ADDRESSEE: David J. Weitz, 

Wilson Sonsini Goodrich & Rosati 

(B) STREET: 650 Page Mill Road 

(C) CITY: Palo Alto 

(D) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 94304-1050 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 3.5 inch diskette 

(B) COMPUTER: IBM compatible 

(C) OPERATING SYSTEM: Microsoft Windows 95 

(D) SOFTWARE: MS WORD 97, 

ASCII (DOS) TEXT format 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 09/018,233 

(B) FILING DATE: February 3, 1998 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 09/017,816 

(B) FILING DATE: February 3, 1998 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 09/018,235 

(B) FILING DATE: February 3, 1998 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 09/017,575 

(B) FILING DATE: February 3, 1998 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 09/018,227 

(B) FILING DATE: February 3, 1998 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 09/018,234 

(B) FILING DATE: February 3, 1998 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 09/ 198,119 

(B) FILING DATE: November 23, 1998 

(viii) ATTORNEY/AGENT INFORMATION: 
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(A) NAME: David J. Weitz 

(B) REGISTRATION NUMBER: 38,362 

(C) REFERENCE/DOCKET NUMBER: 19117.714 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (650) 493-9300 

(B) TELEFAX: (650) 493-6811 

(C) TELEX: None 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 905 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 



(iii) HYPOTHETICAL: No 



(iv) ANTI-SENSE: No 



(V) FRAGMENT TYPE: 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 



(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: CBFl gene 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 

(X) PUBLICATION INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



AAAAAGAATC 


TACCTGAAAA 


GAAAAAAAAG 


AGAGAGAGAT 


ATAAATAGCT 


50 


TACCAAGACA 


GATATACTAT 


CTTTTATTAA 


TCCAAAAAGA 


CTGAGAACTC 


100 


TAGTAACTAC 


GTACTACTTA 


AACCTTATCC 


AGTTTCTTGA 


AACAGAGTAC 


150 


TCTGATCAAT 


GAACTCATTT 


TCAGCTTTTT 


CTGAAATGTT 


TGGCTCCGAT 


200 


TACGAGCCTC 


AAGGCGGAGA 


TTATTGTCCG 


ACGTTGGCCA 


CGAGTTGTCC 


250 


GAAGAAACCG 


GCGGGCCGTA 


AGAAGTTTCG 


TGAGACTCGT 


CACCCAATTT 


300 
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AACTCCGGTA 


AGTGGGTTTC 


TGAAGTGAGA 


350 


GAGCCAAACA 




oHi i i (jLjC 1 C 


GGGACTTTCC 


AAACCGCTGA 


400 


GATGGPAGrT 




AL-(j> 1 C^bC i GC 


ATTAGCCCTC 


CGTGGCCGAT 


450 


CAGCATGTCT 




b/\U i bbbC IT 


GGCGGCTACG 


AATCCCGGAG 


500 


TCAACATGCG 


CCAAGnAT AT 


^ ^ AAAAA L- o 


GCTGCTGAAG 


CGGCGTTGGC 


550 


T T T T C AAG AT 


GAGACrJTGTG 




CACGGATCAT 


GGCCTGGACA 


600 


T G GAG GAG AC 






CACCGGAACA 


GAGCGAAGGT 


650 


GrGTTTT'ATA 




bACAAlbi IT 


GGGATGCCGA 


CTTTGTTGGA 


700 






TTTTACCGCC 


GCCGTCTGTT 


CAATGGAATC 


750 


ATAATTATHA 




bAi bbTGACG 


TGTCGCTTTG 


GAGTTACTAA 


800 


TATTCGATAG 


TCGTTTCCAT 


TTTTGTACTA 


TAGTTTGAAA 


ATATTCTAGT 


850 


TCCTTTTTTA 


GAATGGTTCC 


TTCATTTTAT 


TTTATTTTAT 


TGTTGTAGAA 


900 


AC GAG 










905 


(2) INFORMATION FOR SEQ ID NO : 2 : 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 

(B) TYPE: TVmino Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: No 



(iv) ANTI-SENSE: No 



{v) FRAGMENT TYPE: 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 



(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: CBFl protein 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 
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(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 



Phe 


Ser 

5 


Ala 


Phe 


Ser 


Glu 


Met 


Phe 


Gly 


o C I. 


Asp 


Tyr 


Gly 










10 








T CI 

1 D 


Gly 


Asp 


Tyr 


Cys 


Pro 


Thr 


Leu 


Ala 


Thr 


Ser 


y o 


Pro 


20 










25 












Ala 


Gly 


Arg 


Lvs 


Lys 


Phe 


Arg 


Glu 




Arg 


nx s 




35 










4 0 








A c: 


Arg 


Gly 


Val 


Arg 


Gin 


Ara 


Asn 


Ser 


Gly 


Lys 


i rp 


V d X 


Arg 


50 










55 






bU 


Glu 


Pro 


Asn 


Lvs 


Lys 


Thr 


Arg 


lie 


i rp 


Leu 


ox y 




65 










70 










Thr 


Ala 


Glu 


Met 


Ala 


Ala 


Arg 


Ala 


Hi ^ 

O J. O 




V a J, 


Hxa 


Leu 


80 










85 








y u 


Arg 


Gly 


Arg 


Ser 


Ala 


Cys 


Leu 


Asn 


Phe 


Ala 


Asp 


Arg 


95 










100 










1 n c 
1 U D 


Leu 


Arg 


lie 


Pro 


Glu 


Ser 


Thr 


Cys 


/T.X d 


Lys 


Asp 


Ala 


110 










115 










Ala 


Ala 


Glu 


Ala 


Ala 


Leu 


Ala 


Phe 


Gin 


Asp 


Glu 


Thr 


125 










130 








135 


Thr 


Thr 


Thr 


Asp 


His 


Gly 


Leu 


Asp 


Met 


Glu 


Glu 


Glu 


140 










145 








150 


Ala 


lie 


Tyr 


Thr 


Pro 


Glu 


Gin 


Ser 


Glu 


Gly 


Ala 


Asp 


155 










160 










165 


Glu 


Glu 


Thr 


Met 


Phe 


Gly 


Met 


Pro 


Thr 


Leu 


Leu 


Ala 


170 










175 










180 


Glu 
185 


Gly 


Met 


Leu 


Leu 


Pro 
190 


Pro 


Pro 


Ser 


Val 


Gin 
195 


Asn 


Tyr 


Asp 


Gly 


Glu 


Gly 


Asp 


Gly 


Asp 


Val 


Ser 


Leu 




200 










205 






210 



Trp Ser Tyr 
(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 



(iii) HYPOTHETICAL: No 



(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: N/A - Synthetic 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 
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(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: Table 1 

(X) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO : 3 : 
GATCATTTCA TGGCCGACCT GCTTTTT 2 7 



(3) INFORMATION FOR SEQ ID NO : 4 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 



(iv) ANTI-SENSE: No 



(v) FRAGMENT TYPE: 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: N/A - Synthetic 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 



(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 



(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: Table 1 

(x) PUBLICATION INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

CACAATTTCA AGAATTCACT GCTTTTTT 2 8 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 
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(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 
CD) TOPOLOGY: Linear 



(ii) MOLECULE TYPE; DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: N/A - Synthetic 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 



(G) CELL TYPE 

(H) CELL LINE 

( I ) ORGANELLE 



N/A 
N/A 
N/A 



(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: Table 1 

(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO ; 5 : 
GATCATTTCA TGGTATGTCT GCTTTTT 2 7 



(2) INFORMATION FOR SEQ ID NO : 6 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 



(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 



(v) FRAGMENT TYPE: 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: N/A - Synthetic 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 
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(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 



(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: Table 1 

(X) PUBLICATION INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
GATCATTTCA TGGAATCACT GCTTTTT 



(2) INFORMATION FOR SEQ ID NO : 7 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 



(iii) HYPOTHETICAL: No 



(iv) ANTI-SENSE: No 



(V) FRAGMENT TYPE: 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: N/A - Synthetic 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 



(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 



(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: Table 1 

(x) PUBLICATION INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GATCACTTGA TGGCCGACCT CTTTTTT 
(2) INFORMATION FOR SEQ ID NO : 8 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 



(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: N/A - Synthetic 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: Table 1 

(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
GATCAATATA CTACCGACAT GAGTTCT 27 



(2) INFORMATION FOR SEQ ID NO : 9 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

( D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 



( iii ) HYPOTHETICAL : No 

(iv) ANTI-SENSE: No 
(V) FRAGMENT TYPE: 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: N/A - Synthetic 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 
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(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 



(G) CELL TYPE 

(H) CELL LINE 

( I ) ORGANELLE 



N/A 
N/A 
N/A 



(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: Table 1 

(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
ACTACCGACA TGAGTTCCAA AAAGC 25 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Amino Acid 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis txhaiiana 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 
CD) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: Figure 2D 

(X) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
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lie Tyr Arg Gly Val Arg Gin Arg Asn Ser Gly Lys Trp Val Ser 

5 10 15 

Glu Val Arg Glu Pro Asn Lys Lys Thr Arg He Trp Leu Gly Thr 

25 30 

Phe Gin Thr Ala Glu Met Ala Ala Arg Ala His Asp Val Ala Ala 

35 40 45 

Leu Ala Leu Arg Gly Arg Ser Ala Cys Leu Asn Phe Ala Asp Ser 

^0 55 60 

(2) INFORMATION FOR SEQ ID NO : 1 1 : 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 61 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS : Single 
CD) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Polypeptide 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tobacco 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

CC) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: Figure 2D 

(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

His Tyr Arg Gly Val Arg Gin Arg Pro Trp Gly Lys Phe Ala Ala 
5 10 15 

Glu He Arg Asp Pro Ala Lys Asn Gly Ala Arg Val Trp Leu Gly 

Thr Tyr Glu Thr Ala Glu Glu Ala Ala Leu Ala Tyr Asp Lys Ala 
35 40 45 

Ala Tyr Arg Met Arg Gly Ser Lys Ala Leu Leu Asn Phe Pro His 

Arg 



(2) INFORMATION FOR SEQ ID NO: 12: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 651 

(B) TYPE: Nucleic Acid 
fC) STRANDEDNESS: Single 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE- 

('bJ = ^"^^^°P3i3 thaliana 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE- N/A 

(E) HAPLOTYPE: N/P 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 
(vixi) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: CBF2 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: quencing 

(X) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
ATGAACTCAT TTTCTGCCTT TTCTGAAATG TTTGGCTCCG ATTACGAGTC 
TCCGGTTTCC TCAGGCGGTG ATTACAGTCC GAAGCTTGCC ACGAGCTGCC 
CCAAGAAACC AGCGGGAAGG AAGAAGTTTC GTGAGACTCG TCACCCAATT 
TACAGAGGAG TTCGTCAAAG AAACTCCGGT AAGTGGGTGT GTGAGTTGAG 
AGAGCCAAAC AAGAAAACGA GGATTTGGCT CGGGACTTTC CAAACCGCTG 
AGATGGCAGC TCGTGCTCAC GACGTCGCCG CCATAGCTCT CCGTGGCAGA 
TCTGCCTGTC TCAATTTCGC TGACTCGGCT TGGCGGCTAC GAATCCCGGA 
ATCAACCTGT GCCAAGGAAA TCCAAAAGGC GGCGGCTGAA GCCGCGTTGA 
ATTTTCAAGA TGAGATGTGT CATATGACGA CGGATGCTCA TGGTCTTGAC 
ATGGAGGAGA CCTTGGTGGA GGCTATTTAT ACGCCGGAAC AGAGCCAAGA 
TGCGTTTTAT ATGGATGAAG AGGCGATGTT GGGGATGTCT AGTTTGTTGG 
ATAACATGGC CGAAGGGATG CTTTTACCGT CGCCGTCGGT TCAATGGAAC 
TATAATTTTG ATGTCGAGGG AGATGATGAC GTGTCCTTAT GGAGCTATTA 



50 

100 

150 

200 

250 

300 

350 

400 

450 

500 

550 

600 

650 
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651 



(2) INF0R^4ATI0N FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 216 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Polypeptide 
(111) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE- 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE- N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE; N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

{vii) IMMEDIATE SOURCE: N/A 
Cviii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: CBF2 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 

(X) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
Met Asn ser Phe Ser Ala Phe Ser Glu Met Phe Gly Ser Asp Tyr 
Glu ser Pro Val Ser Ser Gly Gly Asp Pyr Ser Pro Lys Leu 
Thr ser Cys Pro Lys Lys Pro Ala Gly Arg Lys Lys Phe Arg Glu 
Thr Arg His Pro lie Tyr Arg Gly Val A^g Gin Arg Asn Ser cly 
Lys Trp val cys Glu Leu Arg Glu Pro Asn Lys Lys Thr Arg iS 
Trp Leu Gly Thr Phe Gin Thr Ala Glu ult Ala Ala Arg Ala hIs 
ASP val Ala Ala lie Ala Leu Arg Gly Arg Ser Ala Cys Leu As'n 
Phe Ala ASP Ser Ala Trp Arg Leu Arg HI Pro Glu Ser Thr Cys' 
Ala Lys Glu He Gin Lys Ala Ala Ala gIu Ala Ala Leu Asn Phe 
Gin Asp Glu Met Cys His Met Thr Thr Asp Ala His Gly Leu isp 



150 
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Met 


Glu 


Glu 


Thr 


Leu 


Val 


Glu 


Ala 


lie 


Tyr 


Thr 


Pro 


Gin 








155 










160 






Asp 


Ala 


Phe 


Tyr 


Met 


Asp 


Glu 


Glu 


Ala 


Met 


Leu 










170 










175 






Ser 


Leu 


Leu 


Asp 


Asn 


Met 


Ala 


Glu 


Giy 


Met 


Leu 


Leu 










185 








190 




Ser 


Val 


Gin 


Trp 


Asn 


Tyr 


Asn 


Phe 


Asp 


Val 


Glu 


Gly 










200 








205 




Val 


Ser 


Leu 


Trp 


Ser 


Tyr 















165 
Ser 
180 
Pro 
195 
Asp 
210 



215 

(2) INFORMATION FOR SEQ ID NO : 1 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 



(ii) MOLECULE TYPE: 

(iii) HYPOTHETICAL; 



(B) 
(C) 
(D) 

DNA 

No 



651 

TYPE: Nucleic Acid 
STRANDEDNESS : Single 
TOPOLOGY: Linear 



(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 
(1} ORGANELLE; N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 



(ix) 



FEATURE: 



(A) NAME/KEY: CBF3 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 



(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
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(2; INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 216 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Polypeptide 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 
(V) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 



(G) CELL TYPE 

(H) CELL LINE 

(I) ORGANELLE 



N/A 
N/A 
N/A 



(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 



(ix) FEATURE 

(A) NAME/KEY 

(B) LOCATION 



CBF3 
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150 
200 



ATGAACTCAT TTTCTGCTTT TTCTGAAATG TTTGGCTCCG ATTACGAGTC 5 0 

TTCGGTTTCC TCAGGCGGTG ATTATATTCC GACGCTTGCG AGCAGCTGCC 100 
CCAAGAAACC GGCGGGTCGT AAGAAGTTTC GTGAGACTCG TCACCCAATA 
TACAGAGGAG TTCGTCGGAG AAACTCCGGT AAGTGGGTTT GTGAGGTTAG 
AGAACCAAAC AAGAAAACAA GGATTTGGCT CGGAACATTT CAAACCGCTG 2 50 

AGATGGCAGC TCGAGCTCAC GACGTTGCCG CTTTAGCCCT TCGTGGCCGA 300 
TCAGCCTGTC TCAATTTCGC TGACTCGGCT TGGAGACTCC GAATCCCGGA 
ATCAACTTGC GCTAAGGACA TCCAAAAGGC GGCGGCTGAA GCTGCGTTGG 
CGTTTCAGGA TGAGATGTGT GATGCGACGA CGGATCATGG CTTCGACATG 4 50 

GAGGAGACGT TGGTGGAGGC TATTTACACG GCGGAACAGA GCGAAAATGC 500 
GTTTTATATG CACGATGAGG CGATGTTTGA GATGCCGAGT TTGTTGGCTA 
ATATGGCAGA AGGGATGCTT TTGCCGCTTC CGTCCGTACA GTGGAATCAT 
AATCATGAAG TCGACGGCGA TGATGACGAC GTATCGTTAT GGAGTTATTA 
A 



350 
400 



550 
600 
650 
651 
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(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 

(X) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

iSn Ser Phe Ser 
5 

er Ser Val Ser 
20 

er Cys Pro Lys 
35 

rg His Pro lie 
50 

rp Val Cys Glu 
65 

eu Gly Thr Phe 
80 

al Als Ala Leu 
95 

la Asp Ser Ala 
110 



Ser 


Glu 


Met 


Phe 


Gly 


Ser 


Asp 


Tyr 


Gly 




10 








15 


Asp 


Tyr 


lie 


Pro 


Thr 


Leu 


Ala 


Ala 




25 










30 


Gly 


Arg 


Lys 


Lys 


Phe 


Arg 


Glu 


Gly 




40 








45 


Val 


Arg 


Arg 


Arg 


Asn 


Ser 


Gly 


Glu 




55 










60 


Pro 


Asn 


Lys 


Lys 


Thr 


Arg 


He 


Ala 




70 








75 


Glu 


Met 


Ala 


Ala 


Arg 


Ala 


His 


Arg 




85 








90 


Gly 


Arg 
100 


Ser 


Ala 


Cys 


Leu 


Asn 
105 



Ala Lys Asp He Gin Lys Ala Ala Ala gIu Ala Ala Leu Ala Phe 

Gin Asp Glu Met Cys Asp Ala Thr Thr Asp His Gly Phe Asp mH 

140 1/1C 



Glu Glu Thr Leu Val Glu Ala He Tyr ^ir Ala Glu Gin Ser GlS 

155 
Met 
170 
Met 
185 
His 

205 - - ^ ^ 210 



■^^^ 160 ^ 

Asn Ala Phe Tyr Met His Asp Glu Ala Met Phe Glu Met Pro Ser 

175 180 
Met Ala Glu Gly Met Leu Leu Pro Leu Pro Ser 
1 ^, 190 195 

Val Gin Trp Asn His Asn His Glu Val Asp Gly Asp Asp Asp Asp 



Leu Leu Ala Asn Met Ala Glu Gly Met Leu Leu Pro Leu Pro Ser 



Val Ser Leu Trp Ser Tyr 
215 



(2) INFORMATION FOR SEQ ID NO : 1 6 : 



(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 18 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 
{ D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 



( iii ) HYPOTHETICAL : No 
(iv) ANTI-SENSE: No 
(V) FRAGMENT TYPE: 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tobacco 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 
CD) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 
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(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencinq 

(D) OTHER INFORMATION: Figure 2D 

(X) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
TTGGCGGCTA CGAATCCC 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 210 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Polypeptide 

(iii) HYPOTHETICAL: No 



(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica Napus 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE; 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 

(X) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

is Pro He Tyr Arg Gly Vai Arg Leu Arg Lys Ser Giy Lys Trp 

al Cys Glu Val Arg Glu Pro Asn Lys Lys Ser Arg He Trp L^u 
20 25 30 

ly Thr Phe Lys Thr Ala Glu Met Ala Ala Arg Ala Hxs Asp Val 
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Ala Ala 
Asp Ser 
Asp lie 
Glu Lys 
Thr Thr 
Glu His 
Ala Glu 
Glu Thr 
Phe Asn 
Glu Glu 
Glu Met 



Leu Ala 
Ala Trp 
Gin Lys 
Ser Asp 
Ala Val 
Gly Met 
Val Asn 
Met Val 
Met Ala 
Pro Ser 
Pro Thr 



35 
Leu 

50 
Arg 

65 
Ala 

80 
Val 

95 
Ala 
110 
Asn 
125 
Asp 
140 
Glu 
155 
Lys 
170 
Lys 
185 
Leu 
200 



Arg Gly 
Leu Arg 
Ala Ala 
Thr Met 
Ser Gin 
Met Glu 
Thr Thr 
Ala Val 
Glu Ser 
Gly Ser 
Leu Ala 



Arg 

He 

Glu 

Gin 

Ala 

Glu 

Thr 

Phe 

Thr 

Tyr 

Asp 



Gly 
Pro 
Ala 
Asn 
Glu 
Ala 
Asp 
Thr 
Val 
Met 
Met 



40 
Ala 
50 
Glu 
70 
Ala 
85 
Gly 
100 
Val 
115 
Thr 
130 
His 
145 
Gly 
160 
Glu 
175 
Asp 
190 
Ala 
205 



Cys Leu 
Thr Thr 
Leu Ala 
Gin Asn 
Asn Asp 
Ala Val 
Gly Val 
Glu Gin 
Ala Ala 
Glu Glu 
Glu Gly 



Asn Tyr 
Cys His 
Phe Glu 
Met Glu 
Thr Thr 
Ala Ser 
Asp Met 
Ser Glu 
Val Val 
Trp Met 
Met Leu 



45 
Ala 

60 
Lys 

75 
Ala 

90 
Glu 
115 
Thr 
120 
Gin 
135 
Glu 
150 
Gly 
165 
Thr 
180 
Leu 
195 
Leu 
210 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS* 

(A) LENGTH: 632 

(B) TYPE: Nucleic Acid Sequence 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



v'ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE; No 

(v) FRAGMENT TYPE: 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Canola 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE* N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 
(A) NAME/KEY: 
(3) LOCATION: 

(C) IDENTIFICATION METHOD- 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 
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(xi) 


SEQUENCE DESCRIPTION: 


CJP'O T n MO • 


1 Q . 

1 o : 




CACCCGATAT 


ACCGGGGAGT 


TCGTCTGAGA 




i bCrCjl GTG 


50 


TGAAGTGAGG 


GAACCAAACA 


AGAAATCTAG 


r\r\ J. i 1 vjijiw i i 


ULaAAC i i 1 CA 


100 


AAACAGCTGA 


GATGGCAGCT 


CGTGCTCACG 






150 


CGTGGAAGAG 


GCGCCTGCCT 


CAATTATGCG 




Try f~* 


200 


CATCCCGGAG 


ACAACCTGCC 


ACAAGGATAT 




bCTGCTGAAG 


250 


CCGCATTGGC 


TTTTGAGGCT 


GAGAAAAGTG 




GCAAAATGGC 


300 


CAGAACATGG 


AGGAGACGAC 


GGCGGTGGCT 




AAGTGAATGA 


350 


CACGACGACA 


GAACATGGCA 


TGAACATGGA 




GCAGTGGCTT 


400 


CTCAGGCTGA 


GGTGAATGAC 


ACGACGACGG 




AGACATGGAG 


450 


GAGACAATGG 


TGGAGGCTGT 


TTTTACTGGG 




AAGGGTTTAA 


500 


CATGGCGAAG 


GAGTCGACGG 


TGGAGGCTGC 


TGTTGTTACG 


GAGGAACCGA 


550 


GCAAAGGATC 


TTACATGGAC 


GAGGAGTGGA 


TGCTCGAGAT 


GCCGACCTTG 


600 


TTGGCTGATA 


TGGCAGAAGG 


GATGCTCCTG 


CC 




632 


(2) INFORiyiATION FOR SEQ ID NO: 19 









(i) SEQUENCE CHARACTERISTICS- 

CA) LENGTH: 36 

(B) TYPE: Nucleic Acid Seque 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 



(A) 


ORGANISM: 




(B) 


STRAIN: 




(C) 


INDIVIDUAL ISOLATE: 


N/A 


(D) 


DEVELOPMENTAL STAGE: 


N/A 


(E) 


HAPLOTYPE: N/A 


(F) 


TISSUE TYPE: N/A 




(G) 


CELL TYPE: N/A 




(H) 


CELL LINE: N/A 




(I) 


ORGANELLE: N/A 





(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 
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CD) OTHER INFORMATION; 

(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19; 
GCCCAAGCTT CAAGTTTAGT GAGCACTATG TGCTCG 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 

(B) TYPE: Nucleic Acid Sequence 
CO STRANDEDNESS: Single 

CD) TOPOLOGY: Linear 

Cii) MOLECULE TYPE: DNA 
Ciii) HYPOTHETICAL: No 
(iv) ANTI-SENSE: No 
Cv) FRAGMENT TYPE: 
Cvi) ORIGINAL SOURCE: 



CA) 


ORGANISM: 




CB) 


STRAIN: 




CC) 


INDIVIDUAL ISOLATE: 


N/A 


CD) 


DEVELOPMENTAL STAGE: 


N/A 


CE) 


HAPLOTYPE: N/A 


CF) 


TISSUE TYPE: N/A 




(G) 


CELL TYPE: N/A 




CH) 


CELL LINE: N/A 




(I) 


ORGANELLE: N/A 





36 



Cvii) IMMEDIATE SOURCE: N/A 
Cviii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

CA) NAME/KEY: 

CB) LOCATION; 
(C) IDENTIFICATION METHOD: 
CD) OTHER INFORMATION: 

Cx) PUBLICATION INFORMATION: 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

GGAAGATCTC CTTCCCAGAA ACAACACAAT CTAC 34 



C2) INFORMATION FOR SEQ ID NO: 21: 

Ci) SEQUENCE CHARACTERISTICS: 

CA) LENGTH: 35 

CB) TYPE: Nucleic Acid Sequence 

CC) STRANDEDNESS: Single 

CD) TOPOLOGY: Linear 

Cii) MOLECULE TYPE: DNA 
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(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE' N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD- 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

GCCCAAGCTT GTTTCATTTT CTCCATGAAG GAGAT 35 

(2) INFORMATION FOR SEQ ID NO: 22: 

Ci) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 39 

(B) TYPE: Nucleic Acid Sequence 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 
Cv) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE- N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 
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(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD- 
CD) OTHER INFORMATION: 

(K) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

GGA.AGATCTT ATCGTCGTCG TCGTCTACCA AAACCACAC 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 32 

(B) TYPE: Nucleic Acid Seque 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 
(V) FRAGMENT TYPE: 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE- N/A 

CE) HAPLOTYPE: N/A 

CF) TISSUE TYPE: N/A 

CG) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 



(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD - 

(D) OTHER INFORMATION: 



(X) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

GCTCTAAGCT TCACAAGGGG TTCGTTTGGT GC 

C2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS* 

(A) LENGTH: 4 0 

(B) TYPE: Nucleic Acid Sequence 
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(C) STRANDEDNESS: Sinale 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE; 

(A) ORGANISM: 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE* N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD- 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 
GGGGTACCTT TTGGGAGTTG GAATAGAAAT GGGTTTGATG 

(2) INFORMATION FOR SEQ ID NO: 25: 

Ci) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 36 

(B) TYPE: Nucleic Acid Sequence 

(C) STRANDEDNESS: Single ^ ''''^ 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 
(V) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE- N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 
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(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 



(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

GCCCAAGCTT AATTTTACTC AAAATGTTTT GGTTGC 35 



(2) INFORMATION FOR SEQ ID NO : 2 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 

(B) TYPE: Nucleic Acid Sequence 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(A) ORGANISM: 
CB) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 
CCGGTACCTT TCCAAAGATT TTTTTCTTTC CAATAGAAGT AATC 



(2) INFORMATION FOR SEQ ID NO: 27: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 

(B) TYPE: Nucleic Acid Sequence 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 
ORGANISM: 
STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 



(G) CELL TYPE 

(H) CELL LINE 

(I) ORGANELLE 



N/A 
N/A 
N/A 



(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE; 

(A) NANE/KEY; 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

GCGGAAGCTT CATTTTCTGC TACAGAAGTG 30 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 

(B) TYPE: Nucleic Acid Seauence 
CO STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 
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(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 
Cviii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 

(X) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
CCGGTACCTT TCCAAAGCTG TGTTTTCTCT TTTTCAAGTG 4 0 

(2) INFORMATION FOR SEQ ID NO: 29: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 

(B) TYPE: Nucleic Acid Sequence 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 



(G) CELL TYPE 

(H) CELL LINE 

(I) ORGANELLE 



N/A 
N/A 
N/A 



(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 9 : 

GCCCAAGCTT CAAATTCTGA ATATTCACAT ATCAAAAAAG TG 4 2 
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(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS' 

(A) LENGTH: 40 

(B) TYPE: Nucleic Acid Sequence 

(C) STRANDEDNESS: Single 

( D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

( iii ) HYPOTHETICAL : No 

(iv) ANTI-SENSE: No 
(V) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 

(X) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
GGAAGATCTG TTCTTCTTGT CTTAAGCAAA CACTTTGAGC 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS* 

(A) LENGTH: 41 

(B) TYPE: Nucleic Acid Seque 

(C) STRANDEDNESS : Single 

( D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

( iii ) HYPOTHETICAL : No 

(iv) ANTI-SENSE: No 
(V) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: 

(B) STRAIN: 
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(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 

(X) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 1 : 

GCCCAAGCTT TCGTCTGTTA TCATACAAGG CACAAAACGA C 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 42 

(B) TYPE: Nucleic Acid Sequence 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 



(A) 


ORGANISM: 




(B) 


STRAIN: 




(C) 


INDIVIDUAL ISOLATE: 


N/A 


(D) 


DEVELOPMENTAL STAGE: 


N/A 


(E) 


HAPLOTYPE: N/A 




(F) 


TISSUE TYPE: N/A 




(G) 


CELL TYPE: N/A 




(H) 


CELL LINE: N/A 




(I) 


ORGANELLE: N/A 





(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 

(X) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
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GGAAGATCTA GTTAATCTTG ATTTGATTAA AAGTTTATAT AG 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS- 
(A) LENGTH: 25 

{B) TYPE: Nucleic Acid Sequence 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 

(X) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
CAAACTCAGT AGGATTCTGG TGTGT 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS- 
{A) LENGTH: 38 

(B) TYPE: Nucleic Acid Sequence 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 
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(A) 


ORGANISM: 




(B) 


STRAIN: 




(C) 


INDIVIDUAL ISOLATE: 


N/A 


(D) 


DEVELOPMENTAL STAGE: 


N/A 


(E) 


HAPLOTYPE: N/A 




(F) 


TISSUE TYPE: N/A 




(G) 


CELL TYPE: N/A 




(H) 


CELL LINE: N/A 




(I) 


ORGANELLE: N/A 





(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 



(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 



(x) PUBLICATION INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
GGAAGATCTT GAAACAGAGT ACTCTGATCA ATGAACTC 33 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 

(B) TYPE: Nucleic Acid Sequence 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(V) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(A) ORGANISM: 
CB) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
CGCGGATCCC TCGTTTCTAC AACAATAAAA TAAAATAAAA TG 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS* 

(A) LENGTH: 37 

(B) TYPE: Nucleic Acid Sequence 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 
Civ) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 
CH) CELL LINE: N/A 
(I) ORGANELLE: N/A 



(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
GGGGTACCTG AAACAGAGTA CTCTGATCAA TGAACTC 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 4 1 

(B) TYPE: Nucleic Acid Sequence 
CO) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 
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(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 



(G) CELL TYPE 

(H) CELL LINE 

(I) ORGANELLE 



N/A 
N/A 
N/A 



(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE; 

(A) NAME/KEY; 

(B) LOCATION; 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

GCTCTAGACT CGTTTCTACA ACAATAAAAT AAAATAAAAT G 41 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 577 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica juncea 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 



(G) CELL TYPE 

(H) CELL LINE 

( I ) ORGANELLE 



N/A 
N/A 
N/A 



(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: bjCBFl gene 

(B) LOCATION: 
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(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 

(X) PUBLICATION INFORMATION: 

^^^^J""^^ SEQUENCE DESCRIPTION: SEQ ID NO-38- 
TTTCACCCTA TCTACCGGGG AGTTCGCCTG AG^AAGT^AG GTAAGTGGGT 

GTGTGAAGTG AGGGAGCCAA ACAAGAAATC TAGGATTTGG CTTGGAACTT 

TCAAAACCGC AGAGATCGCT GCTCGTGCTC ACGACGTTGC CGCCTTAGCC 

CTCCGTGGAA GAGCGGCCTG TCTCAACTTC GCCGACTCGG CTTGGCGGCT 

CCGTATCCCG GAGACAACTT GCGCCAAGGA TATCCAGAAG GCTGCTGCTG 

AAGCTGCGTT GGCTTTTGGG GCCGAAAAGA GTGATACCAC GACGAATGAT 

CAAGGCATGA ACATGGAGGA GATGACGGTG GTGGCTTCTC AGGCTGAGGT 

GAGCGACACG ACGACATATC ATGGCCTGGA CATGGAGGAG ACTATGGTGG 

AGGCTGTTTT TGCTGAGGAA CAGAGAGAAG GGTTTTACTT GGCGGAGGAG 

ACGACGGTGG AGGGTGTTGT TACGGAGGAA CAGAGCAAAG GGTTTTATAT 

GTACGAGGAG TGGACGTTCG GGATGCAGTC CTTTTTGGCC GATATGGCTG 

AAGGCATGCT CTTTTCAAAG GGCGAAT 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 130 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS: Single 
CD) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Protein 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 
(V) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica juncea 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 
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150 

200 

250 

300 

350 

400 

450 

500 

550 

577 
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(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

CATCCGATCT ACAGGGGAGT TCGTCTGAGA AAATCAGGTA AGTGGGTGTG 50 

TGAAGTGAGG GAACCAAACA AGAGATCTAG GATCTGGCTC GGTACTTTCC 100 

TAACCGCCGA GATCGCAGCT CGCGCTCACG ACGTCGCCGC CATAGCCCTC 150 

CGTGGCAAAT CCGCATGTCT CAATTTCGCT GACTCGGCTT GGCGGCTCCG 2 00 

TATCTCGGAG ACAACATGCC CTAAGGAGAT TCAGAAGGCT GCTGCTGAAG 2 50 

CCGCGGTGGC TTTTCAGGCT GAGCTA7\ATG ATACGACGGC CGATCATGGC 300 

CTTGACGTGG AGGAGACGAT CGTGGAGGCT ATTTTCACGG AGGAAAGCAG 350 

CGAAGGGTTT TATATGGACG AGGAGTTCAT GTTCGGGATG CCGACCTTGT 4 00 

GGGCTAGTAT GGCAGAAGGG ATGCTTCTTC C 4 31 
(2) INFORMATION FOR SEQ ID NO : 4 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

{ii} MOLECULE TYPE: Protein 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 



(A) 


ORGAN ISM: Br a s s i ca 


j uncea 


(B) 


STRAIN: 


(C) 


INDIVIDUAL ISOLATE: 


N/A 


(D) 


DEVELOPMENTAL STAGE: 


N/A 


(E) 


HAPLOTYPE: N/A 




(F) 


TISSUE TYPE: N/A 




(G) 


CELL TYPE: N/A 




(H) 


CELL LINE: N/A 




(1) 


ORGANELLE: N/A 





(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: BJCBF2-PEP 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 
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(xi) 


SEQUENCE 


DESCRIP 


His 


Piro 


lie 


T* \7 T" 

1 yr 




y 


val 


Val 


y o 




V a X 


0 

Ajtci" 


r' 1 n 

tjlu 


Pro 










z u 






Gly 


Thr 


IT 1 Its 


LiG U 


i n JT 


H.JLa 


CjIU 










J D 






Ala 


r-i_L ci 


X i. e 


H.X a 


Leu 


Arg 


Gly 
















Asp 






i rp 


Arg 


Leu 


Arg 










D D 






Glu 


X — S 


r 1 n 


Lys 


Ai a 


Ala 


Ala 










O 0 






Glu 


Leu 


Asn 


Asp 


Thr 


Thr 


Ala 










95 






Thr 


He 


Val 


Glu 


Ala 


He 


Phe 










110 






Tyr 


Met: 


Asp 


Glu 


Giu 


Phe 


Met 










125 






Ser 


Met 


Ala 


Glu 


Gly 


Met 


Leu 










140 







ION: SEQ ID NO : 4 1 : 



Arg 


Leu 


Arg 

10 


Lys 


Ser 


Gly 


Lys 


Trp 

15 


Asn 


Lys 


Arg 


Ser 


Arg 


He 


Trp 


Leu 






25 








30 


He 


Ala 


Ala 


Arg 


Ala 


His 


Asp 


Val 






40 








45 


Lys 


Ser 


Ala 


Cys 


Leu 


Asn 


Phe 


Ala 


He 




55 










60 


Ser 


Glu 
70 


Thr 


Thr 


Cys 


Pro 


Lys 
75 


Glu 


Ala 


Ala 
85 


Val 


Ala 


Phe 


Gin 


A 1 A 
90 


Asp 


His 


Gly 
100 


Leu 


Asp 


Val 


Glu 


Glu 
105 


Thr 


Glu 


Glu 


Ser 


Ser 


Glu 


Gly 


Phe 






115 








120 


Phe 


Gly 


Met 


Pro 


Thr 


Leu 


Trp 


Ala 






130 








135 


Leu 




145 













(2) INFORMATION FOR SEQ ID NO : 4 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 



(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 



(v) FRAGMENT TYPE: 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica juncea 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 
{F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 



(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 



(ix) FEATURE: 

(A) NAME/KEY: bjCBFB gene 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequenci 

(D) OTHER INFORMATION: 



(x) PUBLICATION INFORMATION: 
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(xi) 
TAT rr AATTT 


SEQUENCE DESCRIPTION: 
ACCGTGGAGT TCGTCTGAGA 


SEQ ID NO: 
AAATCAGGTA 


42: 

AGTGGGTGTG 


50 




GAGCCAAACA 


AGAAATCTAG 


GATCTGGCCC 


GGTACTTTCC 


100 


TAACCGCCGA 


GATCGCAGCT 


CGCGCTCACG 


ACGTCGCCGC 


CATAGCCCTC 


150 




CCGCATGTCT 


CAATTTCGCT 


GACTCGGCTT 


GGCGGCTCCG 


200 




ACAACATGCC 


CTAAGGAGAT 


TCAGAAGGCT 


GCTGCTGAAG 


250 


CCGCGGTGGC 


TTTTCAGGCT 


GAGCTAAATG 


ATACGACGGC 


CGATCATGGC 


300 


CTTGACGTGG 


AGGAGACGAT 


CGTGGAGGCT 


ATTTTCACGG 


AGGAAAGCAG 


350 


CGAAGGGTTT 


TATATGGACG 


AGGAGTTCAT 


GTTCGGGATG 


CCGACCTTGT 


400 


GGGCTAGTAT 


GGCGGAGGGC 


ATGCTCCTTC 


C 




431 



(2) INFORMATION FOR SEQ ID NO : 4 3 : 



(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 143 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Protein 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 
(V) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica juncea 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 
CE) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: BJCBF3-PEP 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 

(X) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 3 : 

is Pro He Tyr Arg Gly Val Arg Leu Arg Lys Ser Gly Lys Trp 
5 10 15 
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Val 


Cys 


Glu 


Val 


Ara 


Glu 












20 






Gly 


Thr 


Phe 


Leu 


Thr 


Ala 


Glu 










35 






Ala 


Ala 


He 


Ala 


Leu 


Arg 


Gly 










50 




Asp 


Ser 


Ala 


Trp 


Arg 


Leu 


Arg 










65 




Giu 


He 


Gin 


L vs 


Ala 
80 


Ala 


Ala 


Glu 


Leu 


Asn 


Asp 


Thr 


Thr 


Ala 










95 






Thr 


He 


Val 


Glu 


Ala 


He 


Phe 










110 






Tyr 


Met 


Ala 


Glu 


Glu 


Phe 


Met 










125 






Ser 


Val 


Ala 


Glu 


Gly 


Met 


Leu 










140 







Asn 


Lys 


Lys 


ber 


Arg 


He 


Trp 


Pro 














30 


-1- JL C 


r\x a 


AX a 


Arg 


Ala 


His 


Asp 


Val 






4 u 








45 


Liy s 


c: o V- 
o s r 


fi±a 


Cys 


Leu 


Asn 


Phe 


Ala 


He 














60 


tr J- o 


LjX U 


i nr 


Thr 


Cys 


Pro 


Lys 


Glu 




/ u 










75 


Ala 


Ala 


V d X 


M.X a 


r ne 


Gin 


Ala 






85 










90 


Asp 


His 


Gly 


Leu 


Asp 


Val 


Glu 


Glu 






100 










105 


Thr 


Glu 


Glu 


Ser 


Ser 


Glu 


Gly 


Phe 


Phe 




115 








120 


Gly 


Met 


Pro 


Thr 


Leu 


Trp 


Ala 






130 








135 



Leu 

145 



(2) INFORMATION FOR SEQ ID NO : 4 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 425 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 



(iii) HYPOTHETICAL: No 



(iv) ANTI-SENSE: No 



(V) FRAGMENT TYPE: 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica juncea 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 



(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 



(ix) FEATURE: 

(A) NAME/KEY: bjCBF4 gene 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequenci 

(D) OTHER INFORMATION: 



(X) PUBLICATION INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO * 4 4 • 
CATCCAATCT ACCGGGGTGT TCGACAGAGA AACTCAGGGA AATGGGTTTG 50 

TGAAGTTAGG GAGCCTAATA AGAAATCTAG GATCTGGTTA GGGACTTTTC 100 
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CGACCGTCGA AATGGCCGCT CGTGCTCACG ACGTCGCCGC TTTAGCCCTT 150 

CGTGGCCGCT CCGCTTGTCT TAATTTCGCC GACTCGGCGT GGTGTCTACG 200 

GATTCCCGAG TCTACTTGTC CTAAAGAGAT TCAGAAAGCT GCGGCCGAAG 2 50 

CTGCAATGGC GTTTCAGAAC GAGACGGCTA CGACTGAGAC GACTATGGTT 300 

GAGGGAGTCA TACCGGCGGA GGAGACGGTG GGGCAGACGC GTGTGGAGAC 350 

AGCAGAGGAG AACGGTGTGT TTTATATGGA CGATCCAAGG TTTCTTGAGA 4 00 

ATATGGCAGA GGGCATGTTC CTACC 4 25 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 142 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS: Single 

( D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Protein 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica juncea 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 



(G) CELL TYPE 
{H} CELL LINE 
( I ) ORGANELLE 



N/A 
N/A 
N/A 



(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: BJCBF4-PEP 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFOFIMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 5 • 

His Pro He Tyr Arg Gly Val Arg Gin Arg Asn Ser Gly Lys Trp 

^ 10 15 

Val Cys Glu Val Arg Glu Pro Asn Lys Lys Ser Arg He Trp Leu 

20 25 30 

Gly Thr Phe Pro Thr Val Glu Met Ala Ala Arg Ala His Asp Val 

35 40 45 

Ala Ala Leu Ala Leu Arg Gly Arg Ser Ala Cys Leu Asn Phe Ala 

^0 55 60 
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Asp Ser Ala Trp Cys Leu Arg He Pro Glu Ser Thr Cys Pro Lys 

65 70 75 

Glu He Gin Lys Ala Ala Ala Glu Ala Ala Met Ala Phe Gin Asn 

80 85 90 

Glu Glu Thr Ala Thr Thr Glu Thr Thr Met Val Glu Gly Val He 

95 100 105 

Pro Ala Glu Glu Thr Val Gly Gin Thr Arg Val Glu Thr Ala Glu 

110 115 120 

Glu Asn Gly Val Glu Tyr Met Asp Asp Pro Arg Phe Leu Glu Asn 

125 130 135 

Met Ala Glu Gly Met Leu Phe 

140 145 

(2) INFORMJ^TION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 632 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE; No 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE; 



(A) 


ORGANISM: Brassica 


napus 


(B) 


STRAIN: 


(C) 


INDIVIDUAL ISOLATE: 


N/A 


(D) 


DEVELOPMENTAL STAGE; 


N/A 


{E) 


HAPLOTYPE; N/A 




(F) 


TISSUE TYPE: N/A 




(G) 


CELL TYPE: N/A 




(H) 


CELL LINE: N/A 




{!) 


ORGANELLE: N/A 





(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: bnCBFl gene 

(B) LOCATION: 

(C) IDENTIFICATION METHOD; sequencing 

(D) OTHER INFORMATION: 

{X) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO ; 4 6 : 
CACCCGATAT ACCGGGGAGT TCGTCTGAGA AAGTCAGGTA AGTGGGTGTG 50 

TGAAGTGAGG GAACCAAACA AGAAATCTAG AATTTGGCTT GGAACTTTCA 100 

AAACAGCTGA GATGGCAGCT CGTGCTCACG ACGTCGCTGC CCTAGCCCTC 15 0 

CGTGGAAGAG GCGCCTGCCT CAATTATGCG GACTCGGCTT GGCGGCTCCG 200 

CATCCCGGAG ACAACCTGCC ACAAGGATAT CCAGAAGGCT GCTGCTGAAG 2 50 
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CCGCATTGGC 


i. i A. X \JX^\J\J\^ 1 




AT G T GAG GAT 


GCAAAATGGC 


300 


CAGAACATGG 


AGGAGAPPAP 




TCTCAGGCTG 


AAGTGAATGA 


350 


CACGACGACA 


GAACATGGCA 




bCjAGGCAACG 


GCAGTGGCTT 


400 


CTCAGGCTGA 


GGTGAATGAP 


APPZXPP Arrr* 


Ai LATGGCGT 


AGACATGGAG 


450 






T' T* T' T" A T* ^ 

ill i AC i uGG 


GAACAAAGTG 


AAGGGTTTAA 


500 


CATGGCGAAG 


GAGTCGACGG 


TGGAGGCTGC 


TGTTGTTACG 


GAGGAACCGA 


550 


GCAAAGGATC 


TTACATGGAC 


GAGGAGTGGA 


TGCTCGAGAT 


GCCGACCTTG 


600 


TTGGCTGATA 


TGGCAGAAGG 


GATGCTCCTG 


CO 




632 


(2) INFORMATION FOR SEQ ID NO: 47 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Protein 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE; 

(vi) ORIGINAL SOURCE: 



(A) 


ORGANISM: Brassica 


napus 


(B) 


STRAIN: 


(C) 


INDIVIDUAL ISOLATE: 


N/A 


(D) 


DEVELOPMENTAL STAGE: 


N/A 


(E) 


HAPLOTYPE: N/A 




(F) 


TISSUE TYPE: N/A 




(G) 


CELL TYPE: N/A 




(H) 


CELL LINE: N/A 




(I) 


ORGANELLE: N/A 





(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: BNCBFl-PEP 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 

(X) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
His Pro He Tyr Arg Gly Val Arg Leu Arg Lys Ser Gly Lys Trp 

5 10 15 

Val Cys Glu Val Arg Giu Pro Asn Lys Lys Ser Arg He Trp Leu 
20 25 30 

Gly Thr Phe Lys Thr Ala Glu Met Ala Ala Arg Ala His Asp Val 
33 4 0 ,45 
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Ala 


Ala 


Leu 


Ala 


Leu 


Arg 


Gly 


Arg 


Gly 


Ala 


Cys 


Leu 


Asn 


Tyr 


Ala 










55 










55 








60 


Asp 


Ser 


Ala 


Trp 


Arg 


Leu 


Arg 


lie 


Pro 


Glu 


Thr 


Thr 


Cys 


His 


Lys 










65 










70 








75 


Asp 


He 


Gin 


Lys 


Ala 


Ala 


Ala 


Glu 


Aia 


Ala 


Leu 


Aia 


Phe 


Glu 


Ala 


Glu 








80 










85 










90 


Lys 


Ser 


Asp 


Val 
95 


Thr 


Met 


Gin 


Asn 


Gly 
100 


Gin 


Asn 


Met 


Glu 


Glu 
105 


Thr 


Thr 


Ala 


Val 


Ala 


Ser 


Gin 


Ala 


Glu 


Val 


Asn 


Asp 


Thr 


Thr 


Thr 










110 










115 








120 


Glu 


His 


Gly 


Met 


Asn 
125 


Met 


Glu 


Glu 


Aia 


Thr 
130 


Ala 


Val 


Aia 


Ser 


Gin 
135 


Ala 


Glu 


Val 


Asn 


Asp 


Thr 


Thr 


Thr 


Asp 


His 


Gly 


Val 


Asp 


Met 


Glu 










140 










145 








150 


Glu 


Thr 


Met 


Val 


Glu 
155 


Ala 


Val 


Phe 


Thr 


Gly 
160 


Glu 


Gin 


Ser 


Glu 


Gly 
165 


Phe 


Asn 


Met 


Ala 


Lys 
170 


Glu 


Ser 


Thr 


Val 


Glu 
175 


Ala 


Aia 


Val 


Val 


Thr 
180 


Glu 


Glu 


Pro 


Ser 


Lys 


Gly 


Ser 


Tyr 


Met 


Asp 


Glu 


Glu 


Trp 


Met 


Leu 










185 










190 








195 


Glu 


Met 


Pro 


Thr 


Leu 


Leu 


Ala 


Asp 


Met 


Ala 


Glu 


Gly 


Met 


Leu 


Leu 










200 










205 








210 



(2) INFORMATION FOR SEQ ID NO : 4 8 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 876 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica napus 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: bnCBF2 gene 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 
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(xi) 
ACCGCTCGAG 


SEQUENCE DESCRIPTION: 
CAACAATGAA CACATTCCCT 


SEQ ID NO: 
GCTTCCACTG 


48 : 

AAATGGTTGG 


50 


CTCCGAGAAC 


GAGTCTCCGG 


TTACTACGGT 


AGTAGGAGGT 


GATTATTATC 


100 


CCATGTTGGC 


GGCAAGCTGT 


CCGAAGAAGC 


CAGCGGGTAG 


GAAGAAGTTT 


150 


CAGGAGAPAP 


GTCACCCCAT 


TTACCGAGGA 


GTTCGTCTGA 


GAAAGTCAGG 


200 


TAAGTGGGTP 


TGTGAAGTGA 


GGGAACCAAA 


CAAGAAATCT 


AGAATTTGGC 


250 




CAAAACAGCT 


GAGATGGCAG 


CTCGTGCTCA 


CGACGTCGCT 


300 


^ ^ ^ V-f ± rtO \^ iw. ^ 


TCCGTGGAAG 


AGGCGCCTGC 


CTCAATTATG 


CGGACTCGGC 


350 


TTGGPGGPTP 


CGCATCCCGG 


AAACAACCTG 


CCACAAGGAT 


ATCCAGAAGG 


400 


CTGCTGPTGA 


AGCCGCATTG 


GCTTTTGAGG 


CTGAGAAAAG 


TGATGTGACG 


450 


AT G C AAAAT G 


GCCTGAACAT 


GGAGGAGACG 


ACGGCGGTGG 


CTTCTCAGGC 


500 


TGAAGTGAAT 


GACACGACGA 


CAGAACATGG 


CATGAACATG 


GAGGAGGCAA 


550 


CAGCGGTGGC 


TTCTCAGGCT 


GAGGTGAATG 


ACACGACGAC 


AGATCATGGC 


600 


GTAGACATGG 


AGGAGACGAT 


GGTGGAGGCT 


GTTTTTACGG 


AGGAACAAAG 


650 


TGAAGGGTTP 


AACATGGCGG 


AGGAGTCGAC 


GGTGGAGGCT 


GCTGTTGTTA 


700 




GAGCAAAGGA 


TTTTACATGG 


ACGAGGAGTG 


GACGTACGAG 


750 


ATGCCGACCT 


TGTTGGCTGA 


TATGGCGGCA 


GGGATGCTTT 


TGCCGCCACC 


800 


ATCTGTACAA 


TGGGGACATA 


ATGATGACTT 


GGAAGGAGAT 


GCGGACATGA 


850 


ACCTCTGGAG 


TTATTAAGGA 


TCCGCG 






876 


(2) INFORMATION FOR SEQ ID NO : 4 9 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 283 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



Cii) MOLECULE TYPE: Protein 

( iii ) HYPOTHETICAL : No 

(iv) ANTI-SENSE: No 
(V) FRAGMENT TYPE: 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica nanus 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 
{G) CELL TYPE: N/A 
(H) CELL LINE: N/A 
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(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: BNCBF2-PEP 

(B) LOCATION: 

{C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 



(x) PUBLICATION INFORMATION: 





(xi) 


SEQUENCE 


DESCRIPTION 


: SEQ ID 


NO: 


49: 








Met 


Asn 


Thr 


Phe 


Pro 


Ala 


Ser 


Thr 


Glu 


Met 


Val 


Gly 


Ser 


Glu 


Asn 


Glu 








5 










10 








15 


Ser 


Pro 


Val 


Thr 


Thr 


Val 


Val 


Gly 


Gly 


Asp 


Tyr 


Tyr 


Pro 


Met 










20 










25 








30 


Leu 


Ala 


Ala 


Ser 


Cys 


Pro 


Lys 


Lys 


Pro 


Ala 


Gly 


Arg 


Lys 


Lys 


Phe 


Gin 


Glu 






35 










40 






45 


Thr 


Arg 


His 


Pro 


He 


Tyr 


Arg 


Gly 


Val 


Arg 


Leu 


Arg 


Lys 




Gly 






50 










55 








60 


Ser 


Lys 


Trp 


Val 


Cys 


Glu 


Val 


Arg 


Glu 


Pro 


Asn 


Lys 


Lys 


Ser 










65 










70 






75 


Arg 


lie 


Trp 


Pro 


Gly 


Thr 


Phe 


Lys 


Thr 


Ala 


Glu 


Met 


Ala 


Ala 


Arg 










80 










85 










90 


Ala 


His 


Asp 


Val 


Ala 


Ala 


Leu 


Ala 


Leu 


Arg 


Gly 


Arg 


Gly 


Ala 


Cys 










95 










100 








105 


Leu 


Asn 


Tyr 


Ala 


Asp 


Ser 


Ala 


Trp 


Arg 


Leu 


Arg 


He 


Pro 


Glu 


Thr 










110 










115 








120 


Thr 


Cys 


His 


Lys 


Asp 


lie 


Gin 


Lys 


Ala 


Ala 


Ala 


Glu 


Ala 


Ala 


Leu 










125 










130 










135 


Ala 


Phe 


Glu 


Ala 


Glu 


Lys 


Ser 


Asp 


Val 


Thr 


Met 


Gin 


Asn 


Gly 


Leu 










140 










145 








150 


Asn 


Met 


Glu 


Glu 


Thr 


Thr 


Ala 


Val 


Ala 


Ser 


Gin 


Ala 


Glu 


Val 


Asn 










155 










160 










165 


Asp 


Thr 


Thr 


Thr 


Glu 


His 


Gly 


Met 


Asn 


Met 


Glu 


Glu 


Ala 


Thr 


Ala 










170 










175 










180 


Val 


Ala 


Ser 


Gin 


Ala 


Glu 


Val 


Asn 


Asp 


Thr 


Thr 


Thr 


Asp 


His 


Gly 










185 










190 








195 


Val 


Asp 


Met 


Glu 


Glu 


Thr 


Met 


Val 


Glu 


Ala 


Val 


Phe 


Thr 


Glu 


Glu 










200 










205 










210 


Gin 


Ser 


Glu 


Gly 


Phe 


Asn 


Met 


Ala 


Glu 


Glu 


Ser 


Thr 


Val 


Glu 


Ala 










215 










220 










225 


Ala 


Val 


Val 


Thr 


Asp 


Glu 


Leu 


Ser 


Lys 


Gly 


Phe 


Tyr 


Met 


Asp 


Glu 










230 










235 






240 


Glu 


Trp 


Thr 


Tyr 


Glu 


Met 


Pro 


Thr 


Leu 


Leu 


Ala 


Asp 


Met 


Ala 


Ala 










245 










250 








255 


Gly 


Met 


Leu 


Leu 


Pro 


Pro 


Pro 


Ser 


Val 


Gin 


Trp 


Gly 


His 


Asn 


Asp 










260 










265 










270 


Asp 


Leu 


Glu 


Gly 


Asp 


Ala 


Asp 


Met 


Asn 


Leu 


Trp 


Ser 


Tyr 














275 










280 








285 



(2) INFORMATION FOR SEQ ID NO: 50: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 884 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica napus 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 
{1} ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: bnCBF3 gene 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 



(X) PUBLICATION INFORMATION: 



(xi) 
ACTACACTCA 


SEQUENCE DESCRIPTION: 
GCCTTATCCA GTTTTTTTCA 


SEQ ID NO: 
AAAGATTTTT 


50: 
CAACAATGAA 


50 


CACATTCCCT 


GCGTCCACTG 


AAATGGTTGG 


CTCCGAGAAC 


GAGTCTCCGG 


100 


TTACTACGGT 


AGCAGGAGGT 


GATTATTATC 


CCATGTTGGC 


GGCAAGCTGT 


150 


CCGAAGAAGC 


CAGCAGGTAG 


GAAGAAGTTT 


CAGGAGACAC 


GTCACCCCAT 


200 


TTACCGAGGA 


GTTCGTCTGA 


GAAAGTCAGG 


TAAGTGGGTG 


TGTGAAGTGA 


250 


GGGAACCAAA 


CAAGAAATCT 


AGAATTTGGC 


CCGGAACTTT 


CAAAACAGCT 


300 


GAGATGGCAG 


CTCGTGCTCA 


CGACGTCGCT 


GCCCTAGCCC 


TCCGTGGAAG 


350 


AGGCGCCTGC 


CTCAATTATG 


CGGACTCGGC 


TTGGCGGCTC 


CGCATCCCGG 


400 


AGACAACCTG 


CCACAAGGAT 


ATCCAGAAGG 


CTGCTGCTGA 


AGCCGCATTG 


450 


GCTTTTGAGG 


CTGAGAAAAG 


TGATGTGACG 


ATGCAAAATG 


GCCTGAACAT 


500 


GGAGGAGACG 


ACGGCGGTGG 


CTTCTCAGGC 


TGAAGTGAAT 


GACACGACGA 


550 


CAGAACATGG 


CATGAACATG 


GAGGAGGCAA 


CGGCAGTGGC 


TTCTCAGGCT 


600 


GAGGTGAATG 


ACACGACGAC 


GGATCATGGC 


GTAGACATGG 


AGGAGACAAT 


650 


GGTGGAGGCT 


GTTTTTACTG 


GGGAACAAAG 


TGAAGGGTTT 


AACATGGCGA 


700 


AGGAGTCGAC 


GGTGGAGGCT 


GCTGTTGTTA 


CGGAGGAACC 


GAGCAAAGGA 


750 
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TCTTACATGG ACGAGGAGTG GATGCTCGAG ATGCCGACCT TGTTGGCTGA 8 00 

TATGGCGGAA GGGATGCTTT TGCCGCCGCC GTCCGTACAA TGGGGACAGA 8 50 

ATGATGACTT CGAAGGAGAT GCTGACATGA ACCT 334 
(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 279 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Protein 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica napus 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: BNCBF3-PEP 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 





(xi) 


SEQUENCE 


DESCRIPTION 


: SEQ ID 


NO: 


51 : 








Met 


Asn 


Thr 


Phe 


Pro 


Ala 


Ser 


Thr 


Glu 


Met 


Val 


Giy 


Ser 


Glu 


Asn 


Giu 








5 










10 










15 


Ser 


Pro 


Val 


Thr 


Thr 


Val 


Ala 


Giy 


Gly 


Asp 


Tyr 


Tyr 


Pro 


Met 


Leu 


Ala 






20 










25 








30 


Ala 


Ser 


Cys 


Pro 


Lys 


Lys 


Pro 


Ala 


Giy 


Arg 


Lys 


Lys 


Phe 


Gin 


Glu 






35 










40 






45 


Thr 


Arg 


His 


Pro 


He 


Tyr 


Arg 


Gly 


Val 


Arg 


Leu 


Arg 


Lys 


Ser 


Gly 






50 










55 








60 


Lys 


Trp 


Val 


Cys 


Glu 


Val 


Arg 


Glu 


Pro 


Asn 


Lys 


Lys 


Ser 










65 










70 






75 


Arg 


He 


Trp 


Pro 


Gly 


Thr 


Phe 


Lys 


Thr 


Ala 


Glu 


Met 


Ala 


Ala 


Arg 


Ala 








80 










85 










90 


His 


Asp 


Val 


Ala 


Ala 


Leu 


Ala 


Leu 


Arg 


Gly 


Arg 


Gly 


Ala 


Cys 


Leu 








95 










100 










105 


Asn 


Tyr 


Ala 


Asp 


Ser 


Ala 


Trp 


Arg 


Leu 


Arg 


He 


Pro 


Glu 


Thr 
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110 










115 










120 


Thr 


Cys 


His 


Lys 


Asp 
125 


He 


Gin 


Lys 


Ala 


Ala 
130 


Ala 


Giu 


Ala 


Ala 


Leu 
135 


Ala 


Phe 


Giu 


Ala 


Giu 


Lys 


Ser 


Asp 


Val 


Thr 


Met 


Gin 


Asn 


Gly 


Leu 










14 0 










145 








150 


Asn 


Met 


Giu 


Giu 


Thr 
155 


Thr 


Ala 


Val 


Ala 


Ser 
160 


Gin 


Ala 


Giu 


Val 


Asn 
165 


Asp 


Thr 


Thr 


Thr 


Giu 
170 


His 


Gly 


Met 


Asn 


Met 
175 


Giu 


Giu 


Ala 


Thr 


Ala 
180 


Val 


Ala 


Ser 


Gin 


Ala 


Giu 


Val 


Asn 


Asp 


Thr 


Thr 


Thr 


Asp 


His 


Gly 










185 










190 








195 


Val 


Asp 


Met 


Giu 


Giu 


Thr 


Met 


Val 


Giu 


Ala 


Val 


Phe 


Thr 


Gly 


Giu 










200 










205 








210 


Gin 


Ser 


Giu 


Gly 


Phe 
215 


Asn 


Met 


Ala 


Lys 


Giu 
220 


Ser 


Thr 


Val 


Giu 


Ala 
225 


Ala 


Val 


Val 


Thr 


Giu 


Giu 


Pro 


Ser 


Lys 


Gly 


Ser 


Tyr 


Met 


Asp 


Giu 










230 










235 








240 


Giu 


Trp 


Met 


Leu 


Giu 


Met 


Pro 


Thr 


Leu 


Leu 


Ala 


Asp 


Met 


Ala 


Giu 


Gly 








245 










250 








255 


Met 


Leu 


Leu 


Pro 


Pro 


Pro 


Ser 


Val 


Gin 


Trp 


Gly 


Gin 


Asn 


Asp 










260 










265 








270 


Asp 


Phe 


Giu 


Gly 


Asp 
275 


Ala 


Asp 


Met 


Asn 


280 













(2) INFORMATION FOR SEQ ID NO: 52: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 874 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica napus 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE; N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: bnCBF4 gene 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 

(X) PUBLICATION INFORMATION: 
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(xi) 
GTAATTCGAT 


SEQUENCE DESCRIPTION: 
TACCGCTCGA GTACTTACTA 


SEQ ID NO: 

i He i AL. At^ i 


52 : 

ACjUU i TATLL 


50 


AGTTTTTCAA 


AAGAAGTTTT 


CAACTATGAA 




7\ T' T* Ti rp 


100 


AACTTCTTGG 


CTCTGAGAAC 


GAGTCTCCGG 


J. i (jKj I 


i i AL. i (ji UCC 


150 


ATGTTGGCGG 


CGAGCTGTCC 


GAAGAAGCCG 




AtjAAGTTTCG 


200 


GGAGACACGT 


CACCCCATTT 


ACCGAGGAGT 




AAA i CAGGTA 


250 


AGTGGGTGTG 


TGAAGTGAGG 


GAACCAAACA 




bA rTTGGCTC 


300 


GGAACTTTCA 


AAACAGCTGA 


GATCGCAGCT 




A ^ ^ T* r"* r"* r"^ 
ACb i LbCCbC 


350 


CTTAGCTCTC 


CGTGGAAGAG 


GCGCCTGCCT 




bAb X bbbCTT 


4 00 


GGCGGCTCCG 


TATCCCGGAG 


ACAACCTGCG 




bb AbAAbbb I 


4 50 


GCTGCTGAAG 


CCGCATTGGC 


TTTTGAGGCC 




A i A*bbAbbAb 


500 


GAATGATCAT 


GGCATGAACA 


TGGCTTCTCA 




AA i bAb Ab AA 


550 


CGGATCATGG 


CCTGGACATG 


GAGGAGACGA 




rn /"» rn rp rp rp rp tv rn 

i b i 1 i i 1 AbT 


600 


GAGGAGCAGA 


GAGACGGGTT 


TTACATGGCG 


o-rio o M. M.^^ o /\ 


bbb i bbAbbb 


650 


TGTTGTTCCG 


GAGGAACAGA 


TGAGCAAAGG 


i i 1 o 


bAboAb^jAbT 


700 


GGATGTTCGG 


GATGCCGACC 


TTGTTGGCTG 


t\ i A J. Cj(jCCj(jL. 


AGGGATGCTC 


750 


TTACCGCCGC 


CGTCCGTACA 


ATGGGGACAT 


AATGATGACT 


TCGAAGGAGA 


800 


TGTTGACATG 


AACCTCTGGA 


ATTATTAGTA 


CTCATATTTT 


TTTAAATTAT 


850 


TTTTTGAACG 


AATAATATTT 


TATT 






874 


(2) INFORMATION FOR SEQ ID NO: 53 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 250 

(B) TYPE: Amino Acid 

(C) 3TRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Protein 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 



(A) 


ORGANISM: Bras si ca 


napus 


(B) 


STRAIN: 


(C) 


INDIVIDUAL ISOLATE: 


N/A 


(D) 


DEVELOPMENTAL STAGE: 


: N/A 


(E) 


HAPLOTYPE: N/A 




(F) 


TISSUE TYPE: N/A 




(G) 


CELL TYPE: N/A 




(H) 


CELL LINE: N/A 
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(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: BNCBF4-PEP 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 



Met 


Asn 


Ser 


Val 


Ser 


Thr 


Phe 


Ser 


Glu 


Leu 


Leu 


Gly 


Ser 


Glu 


Asn 










5 










10 








15 


Glu 


Ser 


Pro 


Val 


Gly 


Gly 


Asp 


Tyr 


Cys 


Pro 


Met 


Leu 


Ala 


Ala 


Ser 


Cys 


Pro 






20 










25 










30 


Lys 


Lys 


Pro 
35 


Ala 


Gly 


Arg 


Lys 


Lys 
40 


Phe 


Arg 


Glu 


Thr 


Arg 
45 


His 


Pro 


He 


Tyr 


Arg 


Gly 


Val 


Arg 


Leu 


Arg 


Lys 


Ser 


Gly 


Lys 


Trp 


Vai 








50 










55 






60 


Cys 


Glu 


Val 


Arg 


Glu 


Pro 


Asn 


Lys 


Lys 


Ser 


Arg 


He 


Trp 


Leu 










65 










70 






75 


Gly 


Thr 


Phe 


Lys 


Thr 


Ala 


Glu 


He 


Ala 


Ala 


Arg 


Ala 


His 


Asp 


Val 


Ala 


Ala 






80 










85 








90 


Leu 


Ala 


Leu 


Arg 


Gly 


Arg 


Gly 


Ala 


Cys 


Leu 


Asn 


Phe 


Ala 










95 










100 








105 


Asp 


Ser 


Ala 


Trp 


Arg 


Leu 


Arg 


He 


Pro 


Glu 


Thr 


Thr 


Cys 


Ala 


Lys 










110 










115 








120 


Asp 


He 


Gin 


Lys 


Ala 
125 


Ala 


Ala 


Glu 


Ala 


Ala 
130 


Leu 


Ala 


Phe 


Glu 


Ala 
135 


Glu 


Lys 


Ser 


Asp 


Thr 
140 


Thr 


Thr 


Asn 


Asp 


His 
145 


Gly 


Met 


Asn 


Met 


Ala 
150 


Ser 


Gin 


Ala 


Glu 


Val 


Asn 


Asp 


Thr 


Thr 


Asp 


His 


Gly 


Leu 


Asp 


Met 










155 










160 






165 


Glu 


Glu 


Thr 


Met 


Val 


Glu 


Ala 


Val 


Phe 


Thr 


Glu 


Glu 


Gin 


Arg 


Asp 










170 










175 








180 


Gly 


Phe 


Tyr 


Met 


Ala 


Glu 


Glu 


Thr 


Thr 


Val 


Glu 


Gly 


Val 


Val 


Pro 










185 










190 








195 


Glu 


Glu 


Gin 


Met 


Ser 


Lys 


Gly 


Phe 


Tyr 


Met 


Asp 


Glu 


Glu 


Trp 


Met 










200 










205 








210 


Phe 


Gly 


Met 


Pro 


Thr 


Leu 


Leu 


Ala 


Asp 


Met 


Ala 


Ala 


Gly 


Met 


Leu 










215 










220 








225 


Leu 


Pro 


Pro 


Pro 


Ser 


Val 


Gin 


Trp 


Gly 


His 


Asn 


Asp 


Asp 


Phe 


Glu 










230 










235 








240 


Gly 


Asp 


Val 


Asp 


Met 
245 


Asn 


Leu 


Trp 


Asn 


Tyr 
250 











(2) INFORMATION FOR SEQ ID NO: 54: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 898 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 
CD) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 
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(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica napus 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE; N/A 

(H) CELL LINE: N/A 
{I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: bnCBF5 gene 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
AATAAATATC TTATCAAACC AGTCAGAACA GAGATCTTGT TACTTACTAT 

ACTACACTCA GCCTTATCCA GTTTTCAAAA AAAGTATTCA ACGATGAACT 



CAGTCTCTAC TTTTTCTGAA CTGCTCCGCT CCGAGAACGA GTCTCCGGTT 150 

AATACGGAAG GTGGTGATTA CATTTTGGCG GCGAGCTGTC CCAAGAAACC 2 00 

TGCTGGTAGG AAGAAGTTTC AGGAGACACG CCACCCCATT TACAGAGGAG 250 

TTCGTCTGAG GAAGTCAGGT AAGTGGGTGT GTGAAGTGAG GGAACCAAAC 3 00 

AAGAAATCTA GAATTTGGCT CGGAACTTTC AAAACAGCTG AGATCGCAGC 35 0 

TCGTGCTCAC GACGTTGCCG CCTTAGCTCT CCGTGGAAGA GGCGCCTGCC 4 00 

TCAACTTCGC CGACTCGGCT TGGCGGCTCC GTATCCCGGA GACGACCTGC 4 50 

GCCAAGGATA TCCAGAAGGC TGCTGCTGAA GCCGCATTGG CTTTTGAGGC 5 00 

CGAGAAGAGT GATACCACGA CGAATGATCA TGGCATGAAC ATGGCTTCTC 55 0 

AGGTTGAGGT TAATGACACG ACGGATCATG ACCTGGACAT GGAGGAGACG 600 

ATAGTGGAGG CTGTTTTTAG GGAGGAACAG AGAGAAGGGT TTTACATGGC 650 

GGAGGAGACG ACGGTTGTGG GTGTTGTTCC GGAGGAACAG ATGAGCAAAG 7 00 

GGTTTTACAT GGACGAGGAG TGGATGTTCG GGATGCCGAC CTTGTTGGCT 7 50 

GATATGGCGG CAGGGATGCT CTTACCGCTG CCGTCCGTAC AATGGGGACA 8 00 

TAATGATGAC TTCGAAGGAG ATGCTGACAT GAACCTCTGG AATTATTAGT 8 50 
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ACTCATATTT TTTTAAATTA TTTTTTGAAC GAATAATATT TTATTGAA 
(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS* 

(A) LENGTH: 251 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

Cii) MOLECULE TYPE: Protein 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vx) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica napus 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 
CD) DEVELOPMENTAL STAGE* N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 
CG) CELL TYPE: N/A 
(H) CELL LINE: N/A 
(i) ORGANELLE: N/A 



(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: BNCBF5-PEP 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: seauenci 

(D) OTHER INFORMATION: 



(X) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55 



5 

Asn 

20 
Lys 

35 
Tyr 

50 
Val 

65 
Lys 

80 
Ala 

95 
Trp 
110 
Lys 
125 



Thr 


Phe 


Ser 


Glu 


Leu 


Leu 


Arg 


Ser 


Glu 


Asn 


Thr 








10 










15 


Glu 


Gly 


Gly 


Asp 


Tyr 


He 


Leu 


Ala 


Ala 


Pro 


Ala 






25 










30 


Gly 


Arg 


Lys 


Lys 


Phe 


Gin 


Glu 


Thr 


Arg 








40 










45 


Gly 


Val 


Arg 


Leu 


Arg 


Lys 


Ser 


Gly 


Lys 


Arg 








55 










60 


Glu 


Pro 


Asn 


Lys 


Lys 


Ser 


Arg 


He 


Trp 


Thr 








70 










75 


Ala 


Glu 


He 


Ala 


Ala 


Arg 


Ala 


His 


Asp 


Leu 








85 










90 


Arg 


Gly 


Arg 


Gly 


Ala 


Cys 


Leu 


Asn 


Phe 


Arg 








100 










105 


Leu 


Arg 


He 


Pro 


Glu 


Thr 


Thr 


Cys 


Ala 


Ala 








115 








120 


Ala 


Ala 


Glu 


Ala 


Ala 


Leu 


Ala 


Phe 


Glu 


Thr 


Thr 






130 










135 


Thr 


Asn 


Asp 


His 


Gly 


Met 


Asn 


Met 
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^ 145 150 

Val Asn Asn Thr 
155 165 

Val GIU Ala V;:* 1 Phici Ziv-/-T /^l,, r-T /-T- 

170 1-7^ 

Ala GiU Glu Thr Th-r 1 \7=.l ir^ i 

xgo 

Ser Lvs Glv PHr Tx/t M<af- m ^t., 

200 210 



(2) 



Val 


Asn 


Asp 


Thr 


Thr 


Asp 


His 


Asp 


Leu 










160 








Val 


Glu 


Ala 


Val 


Phe 
175 


Arg 


Glu 


Glu 


Gin 


Ala 


Glu 


Glu 


Thr 


Thr 


Val 


Val 


Gly 


Val 










190 








Ser 


Lys 


Gly 


Phe 


Tyr 


Met 


Asp 


Glu 


Glu 










205 








Thr 


Leu 


Leu 


Ala 


Asp 


Met 


Ala 


Ala 


Gly 










220 








Ser 


Val 


Gin 


Trp 


Gly 
235 
Asn 


His 


Asn 


Asp 


Asp 


Met 


Asn 


Leu 


Trp 


Tyr 
















250 








. SEQ 


ID 


NO : 5 6 : 













215 ^on 225 

ber Val Gin Trr^ m ^/ ui^ 7x„« n 

230 235 240 

^ Met Asn Leu Trn A en Twv- 

255 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1132 

{B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

( D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 



(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 



(A) 


ORGANISM: Brassica 


napus 


(B) 


STRAIN: 


(C) 


INDIVIDUAL ISOLATE: 


N/A 


CD) 


DEVELOPMENTAL STAGE: 


N/A 


(E) 


HAPLOTYPE: N/A 


(F) 


TISSUE TYPE: N/A 




(G) 


CELL TYPE: N/A 




(H) 


CELL LINE: N/A 




(I) 


ORGANELLE: N/A 





(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: bnCBF6 gene 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencin 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO-56' 
GATTACCGCT CGAGTACTTA CTATACTACA CTCAGCCTTA TCCAGTTTTT 

CTCAAAAGAT TTTTCAACAA TGAACACATT CCCTGCTTCC ACTGAAATGG 

TTGGCTCCGA GAACGAGTCT CCGGTTACTA CGGTAGTAGG AGGTGATTAT 
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TATCCCATGT 


TGGCGGCAAG 




A/ilj U U o 


1 AGCjAAG AA 


200 


GTTTCAGGAG 


ACACGTCACC 


CCATTTAprr; 






250 


CAGGTAAGTG 


GGTGTGTGAA 


GTGAGGGAAC 


PAAAPAAPnn 




300 


TGGCTTGGAA 


CTTTCAAAAC 


AGCTGAGATC^ 






350 


GGCTGCCCTA 


GCCCTCCGTG 


GAAGAGr;pr;r 




i ArGCGGACT 


4 00 


CGGCTTCGCG 


GCTCCGCATC 


CCGC^A^APA A 

^ V-' v3 O iT. jtVjTi 




GGATATCCAG 


4 50 


AAGGCTGCTG 


CTGAAGCCGC 


ATTCIC^rTTTT 

^ X X\JVJV^± J. i 1 




A 7\ T\ m T\ m m 

AAAGTGATGT 


500 


GACGATGGAG 


GAGACGATGG 


CGGTGGrTTr 




L i LAATGACA 


550 


CGACGACAGA 


TCATGGCATG 


AACATGGAnr; 


APPPAAr'ArT' 




600 


CAGGCTGAGG 


TGAATGACAC 


GACGACAf^AT 


PATPPr^nTzm 
V-^rt J. oo^^^o 


ALATGGAGGA 


650 


GACGATGGTG 


GAGGCTGTTT 


TT APrif^ Ann A 


IS. IMS. 7\r'T'OA7\ 


GGGTTCAACA 


700 


TGGCGGAGGA 


GTCGACGGTG 




rprp/TprpTv r^r^f^7\ 


TGAACTGAGC 


750 


AAAGGATTTT 


ACATGGACGA 


GGAGTnnAm 


ii-i,*wL3i-iL3A.i tjL- 


LLiACCTTGTT 


800 


GGCTGATATG 


GCGGCAGGGA 


TGCTTTTGCC 


PPPAPP ATPT 


Li i AL AA i LtLjG 


850 


GACATAATGA 


TGACTTGGAA 


GGAGATGCTf; 


APATPZi Zirr'T 


r^rp /— > r-^ TV 7\rprp7\rp 

L i LLtAA i 1 Al 


900 


TAATACTCGT 


GTTTTAAAAA 


TTATACATTG 


TPPAATAATA 


rprprprp-rvryir-ir^Tv -tl 

i i i i Al LLiAA 


950 


TTTCTAATTC 


TGCCTTTAAC 


TTTT A ATrxnr: 

-L X i X rtTT. X LjVJO 


iUil iAil 


A rp rp TV TV T\ 

AGTGTAGGAA 


1000 


ACGAGTGTAA 


ATGTTCCGCC 


GTGGTGTTGT 


CAAAATGCTG 


ATTATTTTTT 


1050 


GTGTGCAGCA 


TAATCACGTT 


TGGTTTCCTT 


TACACTCCAA 


ATTTAGTTGA 


1100 


AATACAAATA 


GAATAGAAAA 


GTGAAAAAAT 


GT 




1132 


(2) INFOEIMATION FOR SEQ ID NO: 57 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 277 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Protein 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 
(V) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica napus 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 
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(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: BNCBF6-PEP 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 



Met 


Asn 


Thr 


Phe 


Pro 


Ala 


Ser 


Thr 


Giu 


Met 


Val 


Gly 


Ser 


Glu 


Asn 










5 










10 








15 


Giu 


Ser 


Pro 


Val 


Thr 


Thr 


Val 


Val 


Gly 


Gly 


Asp 


Tyr 


Tyr 


Pro 


Met 










20 










25 








30 


Leu 


Ala 


Ala 


Ser 


Cys 


Pro 


Lys 


Lys 


Pro 


Ala 


Gly 


Arg 


Lys 


Lys 


Phe 










35 










40 








45 


Gin 


Glu 


Thr 


Arg 


His 


Pro 


He 


Tyr 


Arg 


Gly 


Vai 


Arg 


Leu 


Arg 


Lys 










50 










55 










60 


Ser 


Gly 


Lys 


Trp 


Val 


Cys 


Glu 


Val 


Arg 


Giu 


Pro 


Asn 


Lys 


Lys 


Ser 










65 










70 






75 


Arg 


lie 


Trp 


Leu 


Gly 


Thr 


Phe 


Lys 


Thr 


Ala 


Giu 


Met 


Ala 


Ala 


Arg 










80 










85 










90 


Ala 


His 


Asp 


Val 


Ala 


Ala 


Leu 


Ala 


Leu 


Arg 


Gly 


Arg 


Giy 


Ala 


Cys 










95 










100 










105 


Leu 


Asn 


Tyr 


Ala 


Asp 


Ser 


Ala 


Ser 


Arg 


Leu 


Arg 


He 


Pro 


Giu 


Thr 










110 










115 








120 


Thr 


Cys 


His 


Lys 


Asp 


He 


Gin 


Lys 


Ala 


Ala 


Ala 


Glu 


Ala 


Ala 


Leu 










125 










130 










135 


Ala 


Phe 


Glu 


Ala 


Glu 


Lys 


Ser 


Asp 


Val 


Thr 


Met 


Glu 


Giu 


Thr 


Met 










140 










145 










150 


Ala 


Val 


Ala 


Ser 


Gin 


Ala 


Glu 


Val 


Asn 


Asp 


Thr 


Thr 


Thr 


Asp 


His 










155 










160 








165 


Gly 


Met 


Asn 


Met 


Glu 


Glu 


Ala 


Thr 


Ala 


Val 


Ala 


Ser 


Gin 


Ala 


Glu 










170 










175 










180 


Vai 


Asn 


Asp 


Thr 


Thr 


Thr 


Asp 


His 


Gly 


Val 


Asp 


Met 


Giu 


Giu 


Thr 










185 










190 








195 


Met 


Val 


Glu 


Ala 


Val 


Phe 


Thr 


Glu 


Giu 


Gin 


Ser 


Glu 


Gly 


Phe 


Asn 










200 










205 








210 


Met 


Ala 


Glu 


Glu 


Ser 


Thr 


Val 


Glu 


Ala 


Ala 


Vai 


Vai 


Thr 


Asp 


Glu 










215 










220 








225 


Leu 


Ser 


Lys 


Gly 


Phe 


Tyr 


Met 


Asp 


Glu 


Glu 


Trp 


Thr 


Tyr 


Glu 


Met 










230 










235 








240 


Pro 


Thr 


Leu 


Leu 


Ala 


Asp 


Met 


Ala 


Ala 


Giv 


Met 


Leu 


Leu 


Pro 


Pro 










245 










250 










255 


Pro 


Ser 


Val 


Gin 


Trp 


Gly 


His 


Asn 


Asp 


Asp 


Leu 


Glu 


Giy Asp 


Ala 










260 










265 










270 


Asp 


Met 


Asn 


Leu 


Trp 


Asn 


Tyr 



















275 280 
(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 68 

(B) TYPE: Nucleic Acid 
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(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica napus 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 
(D TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: bnCBF7 gene 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 

(X) PUBLICATION INFORMATION: 



(xi) 
AGTGATGTTT 


SEQUENCE DESCRIPTION: 
TTCAAAAGAA GTTTTCAACT 


SEQ ID NO: 
ATGAACTCAG 


58 : 

TCTCTACTTT 


50 


TTCTGAACTT 


CTTGGCTCTG 


AGAACGAGTC 


TCCGGTAGGT 


GGTGATTACT 


100 


GTCCCATGTT 


GGCGGCGAGC 


TGTCCGAAGA 


AGCCGGCGGG 


TAGGAAGAAG 


150 


TTTCGGGAGA 


CACGTCACCC 


CATTTACCGA 


GGAGTTCGCC 


TTAGAAAATC 


250 


GGCTCGGTAC 


TTTCCTAACA 


GCCGAGATCG 


CAGCCCGTGC 


TCACGACGTC 


300 


GCCGCCATAG 


CCCTCCGCGG 


CAAATCAGCT 


TGTCTCAATT 


TTGCCGACTC 


350 


CGCTTGGCGG 


CTCCGTATCC 


CGGAGACAAC 


ATGCCCCAAG 


GAGATTCAGA 


400 


AGGCGGCTGC 


TGAAGCCGCG 


GTGGCTTTTA 


AGGCTGAGAT 


AAATAATACG 


450 


ACGGCGGATC 


ATGGCATTGA 


CGTGGAGGAG 


ACGATCGTTG 


AGGCTATTTT 


500 


CACGGAGGAA 


AACAACGATG 


GTTTTTATAT 


GGACGAGGAG 


GAGTCCATGT 


550 


TCGGGATGCC 


GGCCTTGTTG 


GCTAGTATGG 


CTGAAGGAAT 


GCTJTTGCCG 


600 


CCTCCGTCCG 


TACAATTCGG 


ACATACCTAT 


GACTTTGACG 


GAGATGCTGA 


650 


CGTGTCCCTT 


TGGAGTTATT 


AGTACAAAGA 


TTTTTTATTT 


CCATTTTTGG 


700 


TATAATACTT 


CTTTTTGATT 


TTCGGATTCT 


ACCTTTTTAT 


GGGTATCATT 


750 
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TTTTTTTTAG GAAACGGG 

(2) INFORMATION FOR SEQ ID NO : 5 9 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Protein 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 



(V) FRAGMENT TYPE: 



(vi) ORIGINAL SOURCE: 



(A) 


ORGANISM: Brassica 


napus 


(B) 


STRAIN: 


(C) 


INDIVIDUAL ISOLATE: 


N/A 


(D) 


DEVELOPMENTAL STAGE: 


N/A 


(E) 


HAPLOTYPE: N/A 




(F) 


TISSUE TYPE: N/A 




(G) 


CELL TYPE: N/A 




(H) 


CELL LINE: N/A 




(1) 


ORGANELLE: N/A 





(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 



(ix) FEATURE: 

(A) NAME/KEY: 3NCBF7-PEP 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 



Cx) PUBLICATION INFORMATION: 





(xi) 


SEQUENCE 


Met 


Asn 


Ser 


Val 


Ser 










5 


Glu 


Ser 


Pro 


Val 


Gly 










20 


Cys 


Pro 


Lys 


Lys 


Pro 










35 


His 


Pro 


He 


Tyr 


Arg 










50 


Val 


Cys 


Glu 


Val 


Arg 










65 


Gly 


Thr 


Phe 


Leu 


Thr 










80 


Ala 


Ala 


He 


Ala 


Leu 










95 


Asp 


Ser 


Ala 


Trp 


Arg 










110 


Glu 


lie 


Gin 


Lys 


Ala 










125 


Glu 


He 


Asn 


Asn 


Thr 



DESCRIPTION: SEQ ID 



Thr 


Phe 


Ser 


Glu 


Leu 










10 


Gly 


Asp 


Tyr 


Cys 


Pro 










25 


Ala 


Gly 


Arg 


Lys 


Lys 










40 


Gly 


Val 


Arg 


Leu 


Arg 










55 


Glu 


Pro 


Asn 


Lys 


Lys 










70 


Ala 


Glu 


He 


Ala 


Ala 










85 


Arg 


Gly 


Lys 


Ser 


Ala 










100 


Leu 


Arg 


He 


Pro 


Glu 










115 


Ala 


Ala 


Glu 


Ala 


Ala 










130 


Thr 


Ala 


Asp 


His 


Gly 



NO: 59: 



Leu 


Gly 


Ser 


Glu 


Asn 










15 


Met 


Leu 


Ala 


Ala 


Ser 










30 


Phe 


Arg 


Glu 


Thr 


Arg 










45 


Lys 


Ser 


Gly 


Lys 


Trp 










60 


Ser 


Arg 


He 


Trp 


Leu 










75 


Arg 


Ala 


His 


Asp 


Val 










90 


Cys 


Leu 


Asn 


Phe 


Ala 










105 


Thr 


Thr 


Cys 


Pro 


Lys 










120 


Val 


Ala 


Phe 


Lys 


Ala 










135 


He 


Asp 


Val 


Glu 


Glu 
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140 










Thr 


He 


Val 


Glu 


Ala 
155 


lie 


Phe 


Thr 


Glu 


Tyr 


Met 


Asp 


Glu 


Glu 
170 


Glu 


Ser 


Met 


Phe 


Ala 


Ser 


Met 


Ala 


Glu 
185 


Gly 


Met 


Leu 


Leu 


Phe 


Gly 


His 


Thr 


Tyr 


Asp 


Phe 


Asp 


Gly 










200 








Trp 


Ser 


Tyr 















145 
Glu 
160 
Gly 
175 
Pro 
190 
Asp 
205 



150 
Phe 
165 
Leu 
180 
Gin 
195 
Leu 
210 



215 



(2) INFORMATION FOR SEQ ID NO:60: 

(i) SEQUENCE CHARACTERISTICS' 

(A) LENGTH: 953 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 



(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica napu 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 
CD) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 



(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 



(ix) FEATURE: 

(A) NAME/KEY: bnCBFB gene 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequenci 

(D) OTHER INFORMATION: 



(X) PUBLICATION INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO*60- 
ACCGCTCGAG CAACAATGAA CACATTCCCT GCTTCCACTG AAATGGTTGG 

CTCCGAGAAC GAGTCTCCGG TTACTACGGT AGCAGGAGGT GATTATTATC 

CCATGTTGGC GGCAAGCTGT CCGAAGAAGC CAGCGGGTAG GAAGAAGTTT 

CAGGAGACAC GTCACCCCAT TTACCGAGGA GTTCGTCTGA GAAAGTCAGG 

TAAGTGGGTG TGTGAAGTGA GGGAACCAAA CAAGAAATCT AGAATTTGGC 
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TTGGAACTTT 


CAAAACAGCT 


G AG AT n n p a 




CUACb i GGC 1 


300 


GCCCTAGCCC 


TCCGTGGAAG 


*^ VJ VJ O \j \^ 1 Ov^ 


i i i A i O 


CUtjAC i uGGC 


350 


TTCGCGGCTC 


CGCATCCCGG 


AG AC AAC CTQ 






400 


CTGCTGCTGA 


AGCCGCATTG 


GCTTTTc?Ar;r; 


PTPAPAZiZi 


i bAi (j i bACG 


450 


ATGGAGGAGA 


CGATGGCGGT 






A i GAL AC GAG 


500 


GACAGATCAT 


GGCATGAACA 






GCTTCTCAGG 


550 


CTGAGGTGAA 


TGACACGACG 






GoAGGAGACA 


600 


ATGGTGGAGG 


CTGTTTTTAC 


TGGGGAArAA 




i iAACATGGC 


650 


GAAGGAGTCG 


ACGGTGGAGG 


CTGCTGTTf^T 


TAPPPZirrrvn 


CCbAGCAAAG 


700 


GATCTTACAT 


GGACGAGGAG 


TGGATGPTPr; 




Cxi GTTGGCT 


750 


GATATGGCGG 


AAGGGATGCT 


J. i. vj w W \J \_< 




AATGuGGACA 


800 






ATGCGGACAT 


GAACCTCTGG 


AGTTATTAAT 


850 


ACTCGTATTT 


TTAAAATTAT 


TTATTGTGCA 


ATAATTTTTT 


ATCGAATTTC 


900 


GAATTCTGCC 


TTTAATTTTT 


AATGGGGATC 


TTTATTTGCC 


AAAAAAAAAA 


950 


AAA 










953 



(2) INF0R^4ATI0N FOR SEQ ID NO: 61: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 277 

(3) TYPE: Amino Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Protein 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 
(V) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica napus 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 
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(A) NAME/KEY: BNCBF8-PEP 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 



(X) PUBLICATION INFORMATION: 





(xi) 


SEQUENCE 


DESCRIPTION 


: SEQ ID 


NO : 


61 : 








Met 


Asn 


Thr 


Phe 


Pro 


Ala 


Ser 


Thr 


Glu 


Met 


Val 


Gly 


Ser 


Glu 


Asn 


Glu 








5 










10 








15 


Ser 


Pro 


Val 


Thr 


Thr 


Val 


Ala 


Gly 


Gly 


Asp 


Tyr 


Tyr 


Pro 


Met 










20 










25 






30 


Leu 


Ala 


Ala 


Ser 


Cys 


Pro 


Lys 


Lys 


Pro 


Ala 


Gly 


Arg 


Lys 


Lys 


Phe 










35 










40 








45 


Gin 


Glu 


Thr 


Arg 


His 


Pro 


He 


Tyr 


Arg 


Gly 


Val 


Arg 


Leu 


Arg 


Lys 










50 










55 








60 


Ser 


Gly 


Lys 


Trp 


Val 


Cys 


Glu 


Val 


Arg 


Glu 


Pro 


Asn 


Lys 


Lys 


Ser 










65 










70 






75 


Arg 


He 


Trp 


Leu 


Gly 


Thr 


Phe 


Lys 


Thr 


Ala 


Glu 


Met 


Ala 


Ala 


Arg 


Ala 








80 










85 










90 


His 


Asp 


Val 


Ala 


Ala 


Leu 


Ala 


Leu 


Arg 


Gly 


Arg 


Gly 


Ala 


Cys 


Leu 








95 










100 








105 


Asn 


Tyr 


Ala 


Asp 


Ser 


Ala 


Ser 


Arg 


Leu 


Arg 


He 


Pro 


Glu 


Thr 










110 










115 








120 


Thr 


Cys 


His 


Lys 


Asp 


He 


Gin 


Lys 


Ala 


Ala 


Ala 


Glu 


Ala 


Ala 


Leu 










125 










130 










135 


Ala 


Phe 


Glu 


Ala 


Glu 


Lys 


Ser 


Asp 


Val 


Thr 


Met 


Glu 


Glu 


Thr 


Met 


Ala 








140 










145 










150 


Val 


Ala 


Ser 


Gin 


Ala 


Glu 


Val 


Asn 


Asp 


Thr 


Thr 


Thr 


Asp 


His 


Gly 








155 










160 








165 


Met 


Asn 


Met 


Glu 


Glu 


Ala 


Thr 


Ala 


Val 


Ala 


Ser 


Gin 


Ala 


Glu 










170 










175 










180 


Val 


Asn 


Asp 


Thr 


Thr 


Thr 


Asp 


His 


Gly 


Val 


Asp 


Met 


Glu 


Glu 


Thr 










185 










190 










195 


Met 


Val 


Glu 


Ala 


Val 


Phe 


Thr 


Gly 


Glu 


Gin 


Ser 


Glu 


Gly 


Phe 


Asn 










200 










205 








210 


Met 


Ala 


Lys 


Glu 


Ser 


Thr 


Val 


Glu 


Ala 


Ala 


Val 


Val 


Thr 


Glu 


Glu 










215 










220 










225 


Pro 


Ser 


Lys 


Gly 


Ser 


Tyr 


Met 


Asp 


Glu 


Glu 


Trp 


Met 


Leu 


Glu 


Met 










230 










235 








240 


Pro 


Thr 


Leu 


Leu 


Ala 


Asp 


Met 


Ala 


Glu 


Gly 


Met 


Leu 


Leu 


Pro 


Pro 


Pro 








245 










250 










255 


Ser 


Val 


Gin 


Trp 


Gly 


Gin 


Asn 


Asp 


Asp 


Phe 


Glu 


Gly 


Asp 


Ala 










260 










265 






270 


Asp 


Met 


Asn 


Leu 


Trp 


Ser 


Tyr 
























275 










280 













(2) INFORMATION FOR SEQ ID NO: 62: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 889 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

{ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 
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(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica napus 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 
CE) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 



(G) CELL TYPE 

(H) CELL LINE 
C I ) ORGANELLE 



N/A 
N/A 
N/A 



(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: bnCBF9 gene 
(E) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFOE^MATION: 

(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

CTAGTGATTA CCGCTCGAGC AACAATGAAC ACATTCCCTG CTTCCACTGA 50 

AATGGTTGGC TCCGAGAACG AGTCTCCGGT TACTACGGTA GCAGGAGGTG 100 

ATTATTATCC CATGTTGGCG GCAAGCTGTC CGAAGAAGCC AGCGGGTAGG 150 

AAGAAGTTTC AGGAGACACG TCACCCCATT TACCGAGGAG TTCGTCTGAG 200 

AAAGTCAGGT AAGTGGGTGT GTGAAGTGAG GGAACCAAAC AAGAAATCTA 2 50 

GAATTTGGCC CGGAACTTTC AAAACAGCTG AGATGGCAGC TCGTGCTCAC 300 

GACGTCGCTG CCCTAGCCCT CCGTGGAAGA GGCGCCCGCC TCAATTATGC 350 

GGACTCAGCT TGGCGGCTCC GCATCCCGGA GACAACCTGC CACAAGGATA 4 00 

TCCAGAAGGC TGCTGCTGAA GCCGCATTGG CTTTTGAGGC TGAGAAAAGT 4 50 

GATGTGACGA TGCAAAATGG CCTGAACATG GAGGAGACGA CGGCGGTGGC 500 

TTCTCAGGCT GAAGTGAATG ACACGACGAC AGAACATGGC ATGAACATGG 550 

AGGAGGCAAC GGCAGTGGCT TCTCAGGCTG AGGTGAATGA CACGACGACG 600 

GATCATGGCG TAGACATGGA GGAGACAATG GTGGAGGCTG TTTTTACTGG 65 0 

GGAACAAAGT GAAGGGTTTA ACATGGCGAA GGAGTCGACG GTGGAGGCTG 7 00 

CTGTTGTTAC GGAGGAACCG AGCAAAGGAT CTTACATGGA CGAGGAGTGG 7 50 

ATGCTCGAGA TGCCGACCTT GTTGGCTGAT ATGGCGGAAG GGATGCTTTT 8 00 

GCCGCCGCCG TCCGTACAAT GGGGACAGAA TGATGACTTC GAAGGAGATG 8 50 

CGCACATGAA CCTCTGGAGT TATTAAGGAT CCGCGAATC 8 89 
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(2) INFORMATION FOR SEQ ID MO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 283 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Protein 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 



(A) 


ORGANISM: Brassica 


napus 


(B) 


STRAIN: 


(C) 


INDIVIDUAL ISOLATE: 


N/A 


O) 


DEVELOPMENTAL STAGE: 


N/A 


(E) 


HAPLOTYPE: N/A 


(F) 


TISSUE TYPE: N/A 




(G) 


CELL TYPE: N/A 




(H) 


CELL LINE: N/A 




{!) 


ORGANELLE: N/A 





{vii} IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: BNCBF9-PEP 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequenci 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 





(xi) 


SEQUENCE 


Met 


Asn 


Thr 


Phe 


Pro 


Glu 


Ser 


Pro 


Val 


5 

Thr 










20 


Leu 


Ala 


Ala 


Ser 


Cys 










35 


Gin 


Glu 


Thr 


Arg 


His 










50 


Ser 


Gly 


Lys 


Trp 


Val 










65 


Arg 


lie 


Trp 


Pro 


Gly 










80 


Ala 


His 


Asp 


Val 


Ala 










95 


Leu 


Asn 


Tyr 


Ala 


Asp 










110 


Thr 


Cys 


His 


Lys 


Asp 










125 


Ala 


Phe 


Glu 


Ala 


Glu 










140 


Asn 


Met 


Glu 


Glu 


Thr 



DESCRIPTION: SEQ ID 



Ala 


Ser 


Thr 


Glu 


Met 










10 


Thr 


Val 


Ala 


Gly 


Gly 










25 


Pro 


Lys 


Lys 


Pro 


Ala 










40 


Pro 


He 


Tyr 


Arg 


Gly 










55 


Cys 


Glu 


Val 


Arg 


Glu 










70 


Thr 


Phe 


Lys 


Thr 


Ala 


Ala 


Leu 


Ala 


Leu 


85 
Arg 










100 


Ser 


Ala 


Trp 


Arg 


Leu 










115 


He 


Gin 


Lys 


Ala 


Ala 










130 


Lys 


Ser 


Asp 


Val 


Thr 










145 


Thr 


Ala 


Val 


Ala 


Ser 



NO: 63: 



Val 


Gly 


Ser 


Glu 


Asn 










15 


Asp 


Tyr 


Tyr 


Pro 


Met 










30 


Gly 


Arg 


Lys 


Lys 


Phe 










45 


Val 


Arg 


Leu 


Arg 


Lys 










60 


Pro 


Asn 


Lys 


Lys 


Ser 










75 


Glu 


Met 


Ala 


Ala 


Arg 










90 


Gly 


Arg 


Gly 


Ala 


Arg 










105 


Arg 


He 


Pro 


Glu 


Thr 










120 


Ala 


Glu 


Ala 


Ala 


Leu 










135 


Met 


Gin 


Asn 


Gly 


Leu 










150 


Gin 


Ala 


Glu 


Val 


Asn 
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155 






Asp 


Thr 


Thr 


Thr 


Glu 


His 


Gly 










170 




Val 


Ala 


Ser 


Gin 


Ala 


Glu 


Val 










185 






Val 


Asp 


Met 


Glu 


Glu 


Thr 


Met 










200 






Gin 


Ser 


Glu 


Gly 


Phe 


Asn 


Met 










215 






Ala 


Val 


Val 


Thr 


Glu 


Glu 


Pro 










230 






Glu 


Trp 


Met 


Leu 


Glu 


Met 


Pro 










245 






Gly 


Met 


Leu 


Leu 


Pro 


Pro 


Pro 










260 






Asp 


Phe 


Glu 


Gly Asp 


Ala 


His 










275 







(2) INFORMATION FOR SEQ ID 







160 










X DO 


Met 


Asn 


Met 


Glu 


Glu 


Ala 


Thr 








175 










18 0 


Asn 


Asp 


Thr 


Thr 


Thr 


Asp 


His 


Gly 






190 










X ^ 


Val 


Glu 


Ala 


Val 


Phe 


Thr 


Gly 


Glu 


Ala 




205 










210 


Lys 


Glu 


Ser 


Thr 


Val 


Glu 


Ala 






220 










22 5 


Ser 


Lys 


Gly 


Ser 


Tyr 


Met 


Asp 


Glu 


Thr 




235 








240 


Leu 


Leu 


Ala 


Asp 


Met 


Ala 


Glu 


Ser 




250 










255 


Val 


Gin 


Trp 


Gly 


Gin 


Asn 


Asp 






265 










270 


Met 


Asn 


Leu 


Trp 


Ser 


Tyr 










280 










285 



NO: 64 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 563 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica oleracea 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: boCBFl gene 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencina 

(D) OTHER INFORMATION: 

(X) PUBLICATION INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO*64- 
CACCCTATCT ACCGGGGAGT TCGCCTGAGA AAGTCAGGTA AGTGGGTGTG 50 

TGAAGTGAGG GAGCCAAACA AGAAATCTAG GATTTGGCTT GGAACTTTCA 100 
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AAACCGCAGA 


GATCGCTGrT 




ACGTTGCCGC 


CTTAGCCCTC 


150 


CGTGGAAGAG 


CGGCCTGTCT 




CjACTCGGCTT 


GGCGGCTCCG 


200 


TATCCCGGAG 


ACAACTTGCG 






GCTGCTGAAG 


250 


CTGCGTTGGC 


TTTTGGGGCC 




i L- G AC 


GAATGATCAA 


300 


GGCATGAACA 


TGGAGGAGAT 




i i LTCAGG 


CTGAGGTGAG 


350 


CGACACGACG 


ACATATCATC^ 




G GAG GAG ACT 


ATGGTGGAGG 


400 




i ort,LjvjH/\L,rt.U 


AGAGAAGGGT 


TTTACTTGGC 


GGAGGAGACG 


450 


ACGGTGGAGG 


GTGTTGTTAC 


GGAGGAACAG 


AGCAAAGGGT 


TTTATATGGA 


500 


CGAGGAGTGG 


ACGTTCGGGA 


TGCAGTCCTT 


TTTGGCCGAT 


ATGGCTGAAG 


550 


GCATGCTCTT 


TCC 








563 



(2) INFORMATION FOR SEQ ID NO: 65: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 188 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

{ii) MOLECULE TYPE: Protein 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 



(A) 


ORGANISM: Brassica 


oleracea 


(B) 


STRAIN: 




(C) 


INDIVIDUAL ISOLATE: 


N/A 


(D) 


DEVELOPMENTAL STAGE 


N/A 


(E) 


HAPLOTYPE: N/A 




(F) 


TISSUE TYPE: N/A 




(G) 


CELL TYPE: N/A 




(H) 


CELL LINE: N/A 




(I) 


ORGANELLE: N/A 





(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: BOCBFl-PEP 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:65- 

is Pro He Tyr Arg Gly Val Arg Leu Arg Lys Ser Gly Lys Trp 
5 10 15 
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Val 


Cys 


Glu 


Val 


Arg 


Glu 


Pro 










20 






o-L y 


i nr 


D K 

r ne 


Lys 


Tnr 


Ala 


Glu 










35 






Ala 


Ala 


Leu 


Ala 


Leu 


Arg 


Gly 










50 




Asp 


Ser 


Ala 


Trp 


Arg 


Leu 


Arg 










65 




Asp 


He 


Gin 


Lys 


Ala 
80 


Ala 


Ala 


Glu 


Lys 


Ser 


Asp 


Thr 


Thr 


Thr 










95 






Glu 


Met 


Thr 


Vai 


Val 


Ala 


Ser 










110 






Thr 


Tyr 


His 


Gly 


Leu 


Asp 


Met 










125 






Phe 


Ala 


Glu 


Glu 


Gin 


Arg 


Glu 










140 






Thr 


Val 


Glu 


Gly 


Val 


Val 


Thr 










155 






Met 


Asp 


Glu 


Glu 


Trp 


Thr 


Phe 










170 






Met 


Ala 


Glu 


Gly 


Met 


Leu 


Phe 










185 







Asn 


Lys 


Lvs 
25 


Ser 


Arg 


He 




Leu 
30 


He 


Ala 


Ala 
40 


Ara 


Ala 


His 


Asp 


V^ 1 
45 


Arg 


Ala 


Ala 
55 


Cys 


Leu 


Asn 


Phe 


A 1 A 
60 


He 


Pro 


Glu 
70 


Thr 


Thr 


Cys 


Ala 


Lys 
75 


Glu 


Ala 


Ala 


Leu 


Ala 


Phe 


Gl y 


t\±, a. 






85 








90 


Asn 


Asp 


Gin 
100 


Glv 


Met 


Asn 


Met 


o X u 

105 


Gin 


Ala 


Glu 
115 


Val 


Ser 


Asp 


Th r 


Th T 

120 


Glu 


Glu 


Thr 

J. o u 


Met 


Val 


Glu 




V d _L 

135 


Gly 


Phe 


Tyr 
145 


Leu 


Ala 


Glu 


Glu 


Thr 
150 


Glu 


Glu 


Gin 
160 


Ser 


Lys 


Gly 


Phe 


Tyr 
165 


Gly 


Met 


Gin 
175 


Ser 


Phe 


Leu 


Ala 


Asp 
180 


Pro 

















(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 533 

(3) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 
(V) FRAGMENT TYPE: 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica oleracea 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 
(?) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 



(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: boCBF2 gene 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 



63/101 



wo 99/38977 



PCT/US99/01895 



(x) PUBLICATION INFORMATION; 



(xi) 

7\ 7\ 7\ TV m TV TV 

G AAAC AT AG A 


SEQUENCE DESCRIPTION: 
TCTTTGTACT TACTATACTT 


SEQ ID NO; 
CACCTTATCC 


66: 

AGTTTTATTT 


50 


TTTTATTTAT 


AAAGAGTTTT 


CAACAATGAC 


CTCATTTTCT 


ACCTTTTCTG 


100 


AAC iGl TGGG 


CTCCGAGCAT 


GAGTCTCCGG 


TTACATTAGG 


CGAAGAGTAT 


150 


TGTCCGAAGC 


TGGCCGCAAG 


CTGTCCGAAG 


AAACCAGCCG 


GCCGGAAGAA 


200 


G 1 TTCGAGAG 


ACGCGTCACC 


CAGTTTACAG 


AGGAGTTCGT 


CTGAGAAACT 


250 


GAGGTAAG i G 


GGTGTGTGAA 


GTGAGGGAGC 


CAAACAAGAA 


ATCTAGGATT 


300 


i GGCTCGGTA 


CTTTCCTAAC 


AGCCGAGATC 


GCAGCCCGTG 


CTCACGACGT 


350 


CGCCGCCATA 


GCCCTCCGCG 


GCAAATCAGC 


TTGTCTCAAT 


TTTGCCGACT 


400 


CCGCTTGGCG 


GCTCCGTATC 


CCGGAGACAA 


CATGCCCCAA 


GGAGATTCAG 


450 


AAGGCGGCTG 


CTGAAGCCGC 


GGTGGCTTTT 


AAGGCTGAGA 


TAAATAATAC 


500 


GACGGCGGAT 


CACGGCCTCG 


ACATGGAAGA 


GAC 




533 


(2) INFORMATION FOR SEQ ID NO: 67 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 152 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Protein 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica oleracea 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: B0CBF2-PEP 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 
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(X) PUBLICATION INFORI^TION: 





(xi) 


SEQUENCE 


DESCRIPTION 


: SEQ ID 


NO: 


67 : 








Met 


Thr 


Ser 


Phe 


OCX 


L nr 


P h o 

IT ne 


Q o V* 

oer 


oiu 


Leu 


Leu 


Gly 


Ser 


Glu 


His 




















iU 








15 


Glu 


Ser 


Pro 


Val 


Thr 


i-ieu 


o X y 


V3 X u 


O J. u 


Tyr 


Cys 


Pro 


Lys 


Leu 


Ala 




















Z D 








30 


Ala 


Ser 


Cy s 


Pro 


Lys 


L y s 


C) y~ r\ 

tr X o 


i-ix a 




Arg 


Lys 


Lys 


Phe 


Arg 


Glu 




















/I ri 
4 U 








45 


Thr 




n X o 


P TO 


V ct X 


T* \7 

i yr 


Ar g 


tjX y 


V ax 


Arg 


Leu 


Arg 


Asn 


Ser 


Gly 










SO 
•J \j 


















60 


Lys 


T rn 


Va 1 


Cy s 


Glu 


Val 


rtx y 


O X U 


£r r O 


Asn 


Lys 


Lys 


Ser 


Arg 


He 










65 










/ U 






75 


Trp 


Leu 


o X y 


i nr 


ir ne 


Leu 


rnr 


Ala 


Glu 


He 


Ala 


Ala 


Arg 


Ala 


His 










o U 










85 








90 


Asp 


Val 


Ala 


Ala 


He 


Ala 


Leu 


Arg 


Gly 


Lys 


Ser 


Ala 


Cys 


Leu 


Asn 










95 










100 








105 


Phe 


Ala 


Asp 


Ser 


Ala 


Trp 


Arg 


Leu 


Arg 


He 


Pro 


Glu 


Thr 


Thr 


Cys 










110 










115 










120 


Pro 


Lys 


Glu 


He 


Gin 


Lys 


Ala 


Ala 


Ala 


Glu 


Ala 


Ala 


Val 


Ala 


Phe 










125 










130 










135 


Lys 


Ala 


Glu 


He 


Asn 


Asn 


Thr 


Thr 


Ala 


Asp 


His 


Gly 


Leu 


Asp 


Met 










140 










145 








150 


Glu 


Glu 




































155 























(2) INFORMJVTION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 887 

(3) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Sxngle 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 
(V) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica oleracea 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: boCBF3 gene 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 
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(D) OTHER INFORMATION: 
(x) PUBLICATION INFORMATION: 



(xi) 

ACTCAGCCTT 


SEQUENCE DESCRIPTION: 
ATCCAGTTTT TCTCAAAAGA 


SEQ ID NO: 
TTTTTCAACA 


68 : 

ATGAACACAT 


50 


TCCCTGCTTC 


CACTGAAATG 


GTTGGCTCCG 


AGAACGAGTC 


TCCGGTTACT 


100 


ACGGTAGTAG 


GAGGTGATTA 


TTATCCCATG 


rTGGCGGCAA 


GCTGTCCG7VA 


150 


GAAGCCAGCG 


GGTAGGAAGA 


AGTTTCAGGA 


GACACGTCAC 


CCCATTTACC 


200 


GAGGAGTTCG 


TCTGAGAAAG 


TCAGGTAAGT 


GGGTGTGTGA 


AGTGAGGGAA 


250 


CCAAACAAGA 


AATCTAGAAT 


TTGGCTTGGA 


ACTTTCAAAA 


CAGCTGAGAT 


300 


GGCAGCTCGT 


GCTCACGACG 


TGGCTGCCCT 


AGCCCTCCGT 


GGAAGAGGCG 


350 




TTATGCGGAC 


TCGGCTTGGC 


GGCTCCGCAT 


CCCGGAGACA 


400 




AGGATATCCA 


GAAGGCTGCT 


GCTGAAGCCG 


CATTGGCTTT 


450 


TGAGGPTGAG 


AAAAGTGATG 


TGACGATGGA 


GGAGACGATG 


GCGGTGGCTT 


500 




AGTGAATGAC 


ACGACGACAG 


ATCATGGCAT 


GAACATGGAG 


550 




CGGTGGCTTC 


TCAGGCTGAG 


GTGAATGACA 


CGACGACAGA 


600 


i v^ri J. o \j i r-i 


GACATGGAGG 


AGACGATGGT 


GGAGGCTGTT 


TTTACGGAGG 


650 


AACAAAGTGA 


AGGGTTCAAC 


ATGGCGGAGG 


AGTCGACGGT 


GGAGGCTGCT 


700 


GTTGTTACGG 


ATGAACTGAG 


CAAAGGATTT 


TACATGGACG 


AGGAGTGGAC 


750 


GTACGAGATG 


CCGACCTTGT 


TGGCTGATAT 


GGCGGCAGGG 


ATGCTTTTGC 


800 


CGCCACCATC 


TGTACAATGG 


GGACATAATG 


ATGACTTGGA 


AGGAGATGCG 


850 


GACATGAACC 


TCTGGAGTTA 


TTAATACTCG 






887 


(2) INFORMATION FOR SEQ ID NO: 69: 







(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 277 

{B) TYPE: Amino Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Protein 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 
(V) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica oleracea 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 
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(E) 
(F) 
(G) 
(H) 
(I) 



HAPLOTYPE: N/A 
TISSUE TYPE: N/A 



CELL TYPE 
CELL LINE 
ORGANELLE 



N/A 
N/A 
N/A 



(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 



(ix) FEATURE: 

(A) NAME/KEY: B0CBF3-PEP 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 



(x) PUBLICATION INFOEIMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 



Met 


Asn 


Thr 


Phe 


Pro 


Ala 


Ser 


Thr 


Giu 


Met 


Val 


Giy 


Ser 


Glu 


Asn 










5 










10 








15 


Glu 


Ser 


Pro 


Vai 


Thr 


Thr 


Val 


Val 


Gly 


Gly 


Asp 


Tyr 


Tyr 


Pro 


Met 










20 










25 










30 


Leu 


Ala 


Ala 


Ser 


Cys 


Pro 


Lys 


Lys 


Pro 


Ala 


Gly 


Arg 


Lys 


Lys 


Phe 










35 










40 








45 


Gin 


Glu 


Thr 


Arg 


His 


Pro 


lie 


Tyr 


Arg 


Gly 


Val 


Arg 


Leu 


Arg 


Lys 










50 










55 








60 


Ser 


Gly 


Lys 


Trp 


Vai 


Cys 


Giu 


Val 


Arg 


Giu 


Pro 


Asn 


Lys 


Lys 


Ser 










65 










70 






75 


Arg 


lie 


Trp 


Leu 


Gly 


Thr 


Phe 


Lys 


Thr 


Ala 


Glu 


Met 


Ala 


Ala 


Arg 










80 










85 










90 


Ala 


His 


Asp 


Val 


Ala 


Ala 


Leu 


Ala 


Leu 


Arg 


Giy 


Arg 


Giy 


Ala 


Cys 










95 










100 










105 


Leu 


Asn 


Tyr 


Ala 


Asp 


Ser 


Ala 


Trp 


Arg 


Leu 


Arg 


lie 


Pro 


Giu 


Thr 










110 










115 










120 


Thr 


Cys 


His 


Lys 


Asp 


lie 


Gin 


Lys 


Ala 


Ala 


Ala 


Glu 


Ala 


Ala 


Leu 










125 










130 










135 


Ala 


Phe 


Glu 


Ala 


Giu 


Lys 


Ser 


Asp 


Vai 


Thr 


Met 


Glu 


Glu 


Thr 


Met 










140 










145 










150 


Ala 


Val 


Ala 


Ser 


Gin 


Ala 


Glu 


Val 


Asn 


Asp 


Thr 


Thr 


Thr 


Asp 


His 










155 










160 








165 


Giy 


Met 


Asn 


Met 


Glu 


Glu 


Ala 


Thr 


Ala 


Val 


Ala 


Ser 


Gin 


Ala 


Glu 










170 










175 










180 


Vai 


Asn 


Asp 


Thr 


Thr 


Thr 


Asp 


His 


Giy 


Vai 


Asp 


Met 


Glu 


Glu 


Thr 










185 










190 








195 


Met 


Vai 


Glu 


Ala 


Val 


Phe 


Thr 


Giu 


Giu 


Gin 


Ser 


Glu 


Giy 


Phe 


Asn 










200 










205 








210 


Met 


Ala 


Giu 


Glu 


Ser 


Thr 


Val 


Giu 


Ala 


Ala 


Val 


Vai 


Thr 


Asp 


Glu 










215 










220 








225 


Leu 


Ser 


Lys 


Gly 


Phe 


Tyr 


Met 


Asp 


Glu 


Glu 


Trp 


Thr 


Tyr 


Glu 


Met 










230 










235 








240 


Pro 


Thr 


Leu 


Leu 


Ala 


Asp 


Met 


Ala 


Ala 


Gly 


Met 


Leu 


Leu 


Pro 


Pro 










245 










250 










255 


Pro 


Ser 


Val 


Gin 


Trp 


Giy 


His 


Asn 


Asp 


Asp 


Leu 


Giu 


Giy 


Asp 


Ala 










260 










265 








270 


Asp 


Met 


Asn 


Leu 


Trp 


Ser 


Tyr 



















275 280 



(2) INFORMATION FOR SEQ ID NO: 70: 
(i) SEQUENCE CHARACTERISTICS: 



67/101 
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(A) LENGTH: 950 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DMA 

(iii) HYPOTHETICAL: No 

(iv) ANTI -SENSE : No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE* 

(A) ORGANISM: Brassica oleracea 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: boCBF4 gene 

(B) LOCATION: 

l^] IDENTIFICATION METHOD: sequencing 
(D) OTHER INFORMATION: 

(X) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO*70- 
CTGAAAAGAA GATAAAAGAG AGAGAAATAA ATATCTTATC AAACCAGACA 

GAACAGAGAT CTTGTTACTT ACTATACTAC ACTCAGCCTT ATCCAGTTTT 

TCAAAAGAAG TTTTCAACTA TGAACTCAGT CTCTACTTTT TCTGAACTTC 

TTGGCTCTGA GAACGAGTCT CCGGTAGGTG GTGATTACTG TCCCATGTTG 

GCGGCGAGCT GTCCGAAGAA GCCGGCGGGT AGGAAGAAGT TTCGGGAGAC 

ACGTCACCCC ATTTACCGAG GAGTTCGCCT TAGAAAATCA GGTAAGTGGG 

TGTGTGAAGT GAGGGAACCA AACAAAAAAT CTAGGATTTG GCTCGGAACT 

TTCAAAACAG CTGAGATCGC AGCTCGTGCT CACGACGTCG CCGCCTTAGC 

TCTCCGTGGA AGAGGCGCCT GCCTCAACTT CGCCGACTCG GCTTGGCGGC 

TCCGTATCCC GGAGACAACC TGCGCCAAGG ATATCCAGAA GGCTGCTGCT 

GAAGCCGCAT TGGCTTTTGA GGCCGAGAAG AGTGATACCA CGACGAATGA 

TCATGGCATG AACATGGCTT CTCAGGCTGA GGTTAATGAC ACGACGGATC 

ATGGCCTGGA CATGGAGGAG ACGATGGTGG AGGCTGTTTT TACTGAGGAG 

68/101 



50 

100 

150 

200 

250 

300 

350 

400 

450 

500 

550 

600 

650 



BNSDOCID; <W0 9938977A2 J_> 



wo 99/38977 



PCT/US99/01895 



CAGAGAGACG 


GGTTTTACAT 






A^jiaCj i Cj i 1 CjtT 


700 


TCCGGAGGAA 


CAGATGAGCA 


AAGGGTTTTA 




T\ 'T 7\ rp*— rn 


750 


TCGGGATGCC 


GACCTTGTTG 


GCTGATATGG 




i U i 1 ACCG 


8 00 


CCGCCGTCCG 


TACAATGGGG 


ACATAATGAT 


GACTTCGAAG 


GAGATGCTGA 


850 


CATGAACCTC 


TGGAATTATT 


AGTACTCGTA 


TTTTTTTAAA 


TTATTTTTTG 


900 


AACGAATAAT 


ATTTTATTGA 


ATTCGGATTC 


TACCTGTTTT 


TTTAATGGAT 


950 


(2) IWrORMATION FOR SEQ ID NO : 7 1 : 







(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 250 
{3) TYPE: Amino Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Protein 

Ciii) HYPOTHETICAL: No 

(iv; ANTI-SENSE: No 

(V) FRAGMENT TYPE: 

(vi; ORIGINAL SOURCE: 

{A) ORGANISM: Brassica oleracea 

(3) STRAIN: 

iC) INDIVIDUAL ISOLATE: N/A 

■D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 
;H) cell LINE: N/A 
(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: 30CBF4-PEP 

(3) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 





{xi; 


SEQUENCE 


DESCRIPTION: 


: SEQ ID 


NO: 


71: 








Met 


Asn 


Ser 


Val 


Ser 


Thr 


Phe 


Ser 


Glu 


Leu 


Leu 


Gly 


Ser 


Glu 


Asn 


Glu 








5 










10 








15 


Ser 


Pro 


Val 


Giy 


Gly 


Asp 


Tyr 


Cys 


Pro 


Met 


Leu 


Ala 


Ala 


Ser 


Cys 








20 










25 










30 


Pro 


Lys 


Lys 


Pro 


Ala 


Gly 


Arg 


Lys 


Lys 


Phe 


Arg 


Glu 


Thr 


Arg 


His 




He 




35 










40 










45 


Pro 


Tyr 


Arg 


Gly 


Val 


Arg 


Leu 


Arg 


Lys 


Ser 


Gly 


Lys 


Trp 


Val 








50 










55 






60 


Cys 


Glu 


Val 


Arg 


Glu 


Pro 


Asn 


Lys 


Lys 


Ser 


Arg 


He 


Trp 


Leu 
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65 

Gly Thr Phe Lys Thr Ala Glu lie Ala 
80 

Ala Ala Leu Ala Leu Arg Gly Arg Gly 
95 

Asp Ser Ala Trp Arg Leu Arg lie Pro 
110 

Asp lie Gin Lys Ala Ala Ala Glu Ala 
125 

Glu Lys Ser Asp Thr Thr Thr Asn Asp 
140 

Ser Gin Ala Glu Val Asn Asp Thr Thr 
155 

Glu Glu Thr Met Val Glu Ala Val Phe 
170 

Gly Phe Tyr Met Ala Glu Glu Thr Thr 
185 

Glu Glu Gin Met Ser Lys Gly Phe Tyr 
200 

Phe Gly Met Pro Thr Leu Leu Ala Asp 
215 

Leu Pro Pro Pro Ser Val Gin Trp Gly 
230 

Gly Asp Ala Asp Met Asn Leu Trp Asn 
245 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 877 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 
(V) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica oleracea 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: boCBF5 gene 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 



70/101 
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70 

Ala Arg Ala 
85 

Ala Cys Leu 
100 

Glu Thr Thr 
115 

Ala Leu Ala 
130 

His Gly Met 
145 

Asp His Gly 
160 

Thr Glu Glu 
175 

Val Glu Gly 
190 

Met Asp Glu 
205 

Met Ala Ala 
220 

His Asn Asp 

235 

Tyr 

250 



75 

His Asp Val 
90 

Asn Phe Ala 
105 

Cys Ala Lys 
120 

Phe Glu Ala 
135 

Asn Met Ala 
150 

Leu Asp Met 
165 

Gin Arg Asp 
180 

Val Val Pro 
195 

Glu Trp Met 

210 

Gly Met Leu 
225 

Asp Phe Glu 
240 



J 1 
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(x) PUBLICATION INFORiyiATION : 



(xi) 
ACCGCTCGAG 


SEQUENCE DESCRIPTION: 
CAACAATGAA CACATTCCCT 


SEQ ID NO: 


72: 

AAATGGTTAG 


50 


CTCCGAGAAC 


GAGTCTCCGG 


TTACTACGGT 




bAl i ATTATC 


100 


CCATGTTGGC 


GGCAAGCTGT 


CCGAAGAAGC 




CjAAbAAtjl TT 


150 


CAGGAGACAC 


GTCACCCCAT 


TTACCGAGGA 




bAAAb i bAbG 


200 


TAAGTGGGTG 


TGTGAAGTGA 


GGGAACTAAA 




AbAAi i i bbC 


250 


TTGGAACTTT 


CAAAACAGCT 


GAGATGGCAG 




bbAbb i bbCT 


300 


GCCCTAGCCC 


TCCGTGGAAG 


AGGCGCCTGC 




bbbAb rCGGC 


350 


TTGGCGGCTC 


CGCATCCCGG 


AGACAACCTG 




ATCCAGAAGG 


400 


CTGCTGCTGA 


AGCCGCATTG 


GCTTTTGAGG 




TGATGCGACG 


450 


AT G C AAAAT G 


GCCTGAACAT 


GGAGGAGACG 


Vj (_ Cj (j C Cjj G 


CTTCTCAGAC 


500 


TGAAGTGAGT 


GACACGACGA 


CAGATCATGG 


*w A 1 bAAL A J, Cj 


GAGGAGACAA 


550 


CGGCGGTGGr 


TTCTCAGGCT 


GAGGTGAATG 


At^AUtjACbAC 


AGATCATGGC 


600 


GTAGACATGG 


AG GAG AC GAT 


GGTGGAGGCT 


m m m rn (Ti TV pp /— i 

b i i i i i AO r G 


TV TV T\ ^ ^ 

AGGAACAAAG 


650 




AACATGGCGA 


AGGAGTCGAC 


T\ rt> 

GGCGGAGGCT 


GCTGTTGTTA 


700 


CGGAGGAACT 


GAGCAAAGGA 


GTTTACATGG 


ACGAGGAGTG 


GACGTACGAG 


750 


ATGCCGACCT 


TGTTGGCTGA 


TATGGCGGCA 


GGGATGCTTT 


TGCCGCCACC 


800 


ATCTGTACAA 


TGGGGACATA 


ATGATGACTT 


GGAAGGAGAT 


GCGGACATGA 


850 


ACCTACTGGA 


GTTATTAAGG 


ATCCGCG 






877 


(2) INFORMATION FOR SEQ ID NO: 73 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 287 

(B) TYPE: Amino Acid 

{C) STRANDEDNESS : Single 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Protein 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 
(V) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica oleracea 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 
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(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: B0CBF5-PEP 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 



(x) PUBLICATION INFORMATION: 





(xi) 


SEQUENCE 


DESCRIPTION 


: SEQ ID 


NO: 


73 : 








Met 


fl c: n 
r\j L i 




Ph o 


irLO 


AX a 


be r 


Thr 


Glu 


Met 


Val 


Ser 


Ser 


Glu 


Asn 


Glu 








c 
D 










10 










15 


C r- 

oer 


Pro 


vai 


i nr 


Thr 


Val 


Val 


Gly 


Gly 


Asp 


Tyr 


Tvr 


Pro 


Met 


Leu 




A 1 ^ 

r\.l. a. 




z u 










25 








30 


oe r 


^ys 


Pro 


Lys 


Lys 


Pro 


Ala 


Gly 


Arg 


Lys 


L vs 


Phe 


Gin 








-J D 










40 






45 


Glu 


Thr 


Arg 


His 


XT X U 


X J. e 


Tyr 


Arg 


Cjly 


Val 


Arg 


Leu 


Arg 


Lvs 




















55 








60 


Ser 


Gly 


Ly s 


i rp 


V cl J_ 


u-y s 


L3± u 


val 


Arg 


Glu 


Leu 


Asn 


Lys 


Lys 


Ser 










O D 










70 






75 


Arg 


He 




Leu 




i Ilx 


irne 


Lys 


i nr 


Ala 


Glu 


Met 


Ala 


Ala 


Arq 


Ala 


His 






O U 










85 










90 


Asp 


Val 


<rt. J- Cl 


A 1 ;q 

rt.X Cl 


Leu 


Axa 


Leu 


Arg 


Gly 


Arg 


Gly 


Ala 


Cvs 




















100 








105 


Leu 


Asn 


Tyr 


Ai a 


Asp 


Ser 


Ala 


Trp 


Arg 


Leu 


Arg 


He 


Pro 


Glu 


Thr 


Thr 








110 










115 










120 


Cys 


His 


Lys 


Asp 


He 


Gin 


Lys 


Ala 


Ala 


Ala 


Glu 


Ala 


Ala 


Leu 










125 










130 










135 


Ala 


?he 


Glu 


Ala 


Glu 


Lys 


Ser 


Asp 


Ala 


Thr 


Met 


Gin 


Asn 


Gly 


Leu 










140 










145 










150 


Asn 


Met 


Glu 


Glu 


Thr 


Thr 


Ala 


Ala 


Ala 


Ser 


Gin 


Thr 


Glu 


Val 


Ser 










155 










160 










165 


Asp 


Thr 


Thr 


Thr 


Asp 


His 


Gly 


Met 


Asn 


Met 


Glu 


Glu 


Thr 


Thr 


Ala 










170 










175 










180 


Val 


Ala 


Ser 


Gin 


Ala 


Glu 


Val 


Asn 


Asp 


Thr 


Thr 


Thr 


Asp 


His 


Gly 










185 










190 








195 


Val 


Asp 


Met 


Glu 


Glu 


Thr 


Met 


Val 


Glu 


Ala 


Val 


Phe 


Thr 


Glu 


Glu 


Gin 








200 










205 










210 


Ser 


Glu 


Gly 


Phe 


Asn 


Met 


Ala 


Lys 


Glu 


Ser 


Thr 


Ala 


Glu 


Ala 


Ala 








215 










220 










225 


Val 


Val 


Thr 


Glu 


Glu 


Leu 


Ser 


Lys 


Gly 


Val 


Tyr 


Met 


Asp 


Glu 


Glu 








230 










235 








240 


Trp 


Thr 


Tyr 


Glu 


Met 


Pro 


Thr 


Leu 


Leu 


Ala 


Asp 


Met 


Ala 


Ala 


Gly 








245 










250 








255 


Met 


Leu 


Leu 


Pro 


Pro 


Pro 


Ser 


Val 


Gin 


Trp 


Gly 


His 


Asn 


Asp 










260 










265 








270 


Asp 


Leu 


Glu 


Gly 


Asp 


Ala 


Asp 


Met 


Asn 


Leu 


Leu 


Glu 


Leu 


Leu 


Arg 










275 










280 










285 


He 


Arg 





























(2) INFORMATION FOR SEQ ID NO: 74: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 374 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 
(V) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica rapa 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: brCBFl gene 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencinq 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO • 7 4 * 
CATCCCATTT ACAGGGGGGT TCGTTTAAGA AAGTCAGGTA AGTGGGTGTG 

TGAAGTGAGG GAACCAAACA AGAAATCTAG GATTTGGCTC GGAACTTTCA 

AAACCGCTGA GATCGCTGCT CGTGCTCACG ACGTTGCTGC CTTAGCCCTC 

CGCGGGAGAG GCGCCTGCCT CAACTTCGCC GACTCGGCTT GGCGGCTCCG 

TATCCCGGAG ACAACCTGCG CCAAGGACAT CCAGAAGGCG GCTGCTGAAG 

CTGCATTGGC TTTTGAGGCC GAGAAGAGTG ATCATGGCAT GAACATCAAG 

AATACTACGG CGGTGGTTTC TCAGGTTGAG GTGAATGACA CGACGACGGA 

CCACGGCTTG GACATGGAGG AGAC 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 124 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Protein 

73/101 



50 

100 

150 

200 

250 

300 

350 

374 
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(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 



(V) FRAGMENT TYPE: 



(vi) ORIGINAL SOURCE: 



(A) 


ORGANISM: Brassica 


rapa 


{B) 


STRAIN: 


(C) 


INDIVIDUAL ISOLATE: 


N/A 


(D) 


DEVELOPMENTAL STAGE: 


N/A 


(E) 


HAPLOTYPE: N/A 




(F) 


TISSUE TYPE: N/A 




(G) 


CELL TYPE: N/A 




(H) 


CELL LINE: N/A 




(I) 


ORGANELLE: N/A 





(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 



(ix) FEATURE: 

(A) NAME/KEY: BRCBFl-PEP 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 



(x) PUBLICATION INFORMATION: 



(xi; SEQUENCE DESCRIPTION: SEQ ID NO: 75: 



His 


Pro 


He 


Tyr 


Arg 
5 


Gly 


Val 


Val 


Cys 


Giu 


Val 


Arg 


Glu 


Pro 










20 






Gly 


Thr 


Phe 


Lys 


Thr 


Ala 


Glu 










35 






Ala 


Ala 


Leu 


Ala 


Leu 


Arg 


Gly 










50 






Asp 


Ser 


Ala 


Trp 


Arg 


Leu 


Arg 










65 






Asp 


He 


Gin 


Lys 


Ala 


Ala 


Ala 










80 






Glu 


Lys 


Ser 


Asp 


His 


Gly 


Met 










95 






Val 


Ser 


Gin 


Val 


Glu 


Val 


Asn 










110 






Asp 


Met 


Glu 


Glu 









Arg 


Leu 


Arg 


Lys 


Ser 


Gly 


Lys 


Trp 






10 










15 


Asn 


Lys 


Lys 


Ser 


Arg 


He 


Trp 


Leu 






25 








30 


He 


Ala 


Ala 


Arg 


Ala 


His 


Asp 


Val 






40 








45 


Arg 


Gly 


Ala 


Cys 


Leu 


Asn 


Phe 


Ala 






55 










60 


He 


Pro 


Glu 


Thr 


Thr 


Cys 


Ala 


Lys 






70 










75 


Glu 


Ala 


Ala 


Leu 


Ala 


Phe 


Glu 


Ala 






85 










90 


Asn 


He 


Lys 


Asn 


Thr 


Thr 


Ala 


Val 






100 










105 


Asp 


Thr 


Thr 


Thr 


Asp 


His 


Gly 


Leu 






115 








120 



(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 884 
(3} TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE; DNA 



(iii) HYPOTHETICAL: No 
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(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica rapa 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 



(G) CELL TYPE 

(H) CELL LINE 
{ I ) ORGANELLE 



N/A 
N/A 
N/A 



(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: brCBF2 gene 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:76- 

TACACTCAGC CTTATCCAGT TTTTTTCAAA AGACTTTTCA ACAATGAACA 50 

CATTCCCTGC GTCCACTGAA ATGGTTGGCT CCGAGAACGA GTCTCCGGTT 100 

ACTACGGTAG CAGGAGGTGA TTATTATCCC ATGTTGGCGG CAAGCTGTCC 150 

GAAGAAGCCA GCGGGTAGGA AGAAGTTTCA GGAGACACGT CACCCCATTT 200 

ACCGAGGAGT TCGTCTGAGA AAGTCAGGTA AGTGGGTGTG TGAAGTGAGG 250 

GAACCAAACA AGAAATCTAG AATTTGGCTT GGAACTTTCA AAACAGCTGA 300 

GATGGCAGCT CGTGCTCACG ACGTCGCTGC CCTAGCCCTC CGTGGAAGAG 350 

GCGCCTGCCT CAATTATGCG GACTCGGCTT GGCGGCTCCG CATCCCGGAG 4 00 

ACAACCTGCC ACAAGGATAT CCAGAAGGCT GCTGCTGAAG CCGCATTGGC 4 50 

TTTTGAGGCT GAGAAAAGTG ATGTGACGAT GCAAAATGGC CTGAACATGG 500 

AGGAGATGAC GGCGGTGGCT TCTCAGGCTG AAGTGAATGA CACGACGACA 550 

GAACATGGCA TGAACATGGA GGAGGCAACG GCAGTGGCTT CTCAGGCTGA 600 

GGTGAATGAC ACGACGACGG ATCATGGCGT AGACATGGAG GAGACAATGG 650 

TGGAGGCTGT TTTTACTGAG GAACAAAGTG AAGGGTTTAA CATGGCGAAG 7 00 

GAGTCGACGG TGGAGGCTGC TGTTGTTACG GAGGAACCGA GCAAAGGATC 7 50 

TTACATGGAC GAGGAGTGGA TGCTCGAGAT GCCGACCTTG TTGGCTGATA 8 00 

TGGCGGAAGG GATGCTTTTG CCGCCGCCGT CCGTACAATG GGGACAGAAT 850 
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GATGACTTCG AAGGAGATGC TGACATGAAC CTCT 
(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 280 

(B) TYPE: Amino Acid 

CO STRANDEDNESS: Single 
CD) TOPOLOGY: Linear 

Cii) MOLECULE TYPE: Protein 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 
Cv) FRAGMENT TYPE: 
Cvi) ORIGINAL SOURCE: 



CA) 


ORGANISM: Brass ica 


rapa 


(B) 


STRAIN: 


(C) 


INDIVIDUAL ISOLATE: 


N/A 


CD) 


DEVELOPMENTAL STAGE: 


N/A 


CE) 


HAPLOTYPE: N/A 




(F) 


TISSUE TYPE: N/A 




CG) 


CELL TYPE: N/A 




CH) 


CELL LINE: N/A 




(I) 


ORGANELLE: N/A 





Cvii) IMMEDIATE SOURCE: N/A 
Cviii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: BRCBF2-PEP 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 

Cx) PUBLICATION INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 



Met 


Asn 


Thr 


Phe 


Pro 


Ala 


Ser 


Thr 


Glu 


Met 


Val 


Gly 


Ser 


Glu 


Asn 


Glu 








5 










10 








15 


Ser 


Pro 


Val 


Thr 


Thr 


Val 


Ala 


Gly 


Gly 


Asp 


Tyr 


Tyr 


Pro 


Met 




Ala 






20 










25 








30 


Leu 


Ala 


Ser 


Cys 


Pro 


Lys 


Lys 


Pro 


Ala 


Gly 


Arg 


Lys 


Lys 


Phe 


Gin 








35 










40 






45 


Glu 


Thr 


Arg 


His 


Pro 


He 


Tyr 


Arg 


Gly 


Val 


Arg 


Leu 


Arg 


Lys 


Ser 


Gly 






50 










55 








60 


Lys 


Trp 


Val 


Cys 


Glu 


Val 


Arg 


Glu 


Pro 


Asn 


Lys 


Lys 


Ser 




He 






65 










70 






75 


Arg 


Trp 


Leu 


Gly 


Thr 


Phe 


Lys 


Thr 


Ala 


Glu 


Met 


Ala 


Ala 


Arg 


Ala 


His 






80 










85 










90 


Asp 


Val 


Ala 


Ala 


Leu 


Ala 


Leu 


Arg 


Gly 


Arg 


Gly Ala 


Cys 


Leu 








95 










100 










105 


Asn 


Tyr 


Ala 


Asp 


Ser 


Ala 


Trp 


Arg 


Leu 


Arg 


He 


Pro 


Glu 


Thr 


Thr 








110 










115 








120 


Cys 


His 


Lys 


Asp 


He 


Gin 


Lys 


Ala 


Ala 


Ala 


Glu 


Ala 


Ala 


Leu 


Ala 








125 










130 










135 


Phe 


Glu 


Ala 


Glu 


Lys 


Ser 


Asp 


Val 


Thr 


Met 


Gin 


Asn 


Gly 


Leu 
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140 










145 










150 


Asn 


Met 


Glu 


Glu 


Met 

155 


Thr 


Ala 


Val 


Ala 


Ser 
160 


Gin 


Ala 


Glu 


Val 


Asn 
165 


Asp 


Thr 


Thr 


Thr 


Glu 
170 


His 


Gly 


Met 


Asn 


Met 
175 


Glu 


Glu 


Ala 


Thr 


Ala 

180 


Val 


Ala 


Ser 


Gin 


Ala 


Glu 


Val 


Asn 


Asp 


Thr 


Thr 


Thr 


Asp 


His 


Gly 










185 










190 








195 


Val 


Asp 


Met 


Glu 


Glu 
200 


Thr 


Met 


Val 


Glu 


Ala 
205 


Val 


Phe 


Thr 


Glu 


Glu 
210 


Gin 


Ser 


Glu 


Gly 


Phe 
215 


Asn 


Met 


Ala 


Lys 


Glu 
220 


Ser 


Thr 


Val 


Glu 


Ala 
225 


Ala 


Val 


Val 


Thr 


Glu 


Glu 


Pro 


Ser 


Lys 


Gly 


Ser 


Tyr 


Met 


Asp 


Glu 










230 










235 








240 


Glu 


Trp 


Met 


Leu 


Glu 


Met 


Pro 


Thr 


Leu 


Leu 


Ala 


Asp 


Met 


Ala 


Glu 










245 










250 








255 


Gly 


Met 


Leu 


Leu 


Pro 
260 


Pro 


Pro 


Ser 


Val 


Gin 
265 


Trp 


Gly 


Gin 


Asn 


Asp 
270 


Asp 


Phe 


Glu 


Gly 


Asp 
275 


Ala 


Asp 


Met 


Asn 


Leu 
280 













(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

( D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 
Ciii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica rapa 

(3) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 



(G) CELL TYPE 

(H) CELL LINE 
{ I ) ORGANELLE 



N/A 
N/A 
N/A 



(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: brCBF3 gene 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 

(X) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
ACACTCAGCC TTATCCAGTT TTCAAAAAAA GTATTCAACG ATGAACTCAG 50 
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TCTCTACTTT 


TTCTGAACTG 


CTCTGCTCCG 




i L^Cvjtj i i AAl 


100 


ACGGAAGGTG 


GTGATTACAT 


TTTGGCGGCG 


AGCTGTCCCA 


APAAAPr^Trr* 


15 0 


TGGTAGGAAG 


AAGTTTCAGG 


AGACACGCCA 


CCCCATTTAC 


APAPPAPTTP 


O A O 


GTCTGAGGAA 


GTCAGGTAAG 


TGGGTGTGTG 


AAGTGAGGGA 


APPAAZir'Zizir' 


2 5 0 


AAATCTAGAA 


TTTGGCTCGG 


AACTTTCAAA 


ACAGCTGAGA 




r\ r\ 


TGCTCACGAC 


GTTGCCGCCT 


TAGCTCTCCG 




PPPTPrT'TT'Ti 


350 


ACTTCGCCGA 


CTCGGCTTGG 


CGGCTCCGTA 




(ji^Ut^ i VjCGCL 


4 00 


AAGG AT AT CC 


AGAAGGCTGC 


TGCTGAAGCC 






4 50 


GAAGAGTGAT 


ACCACGACGA 


ATGATCGTGG 


CATPAAP ATP 




500 


CGGCGGTGGC 


TTCTCCGGCT 


GAGTTGAATG 


ATAPPAPPPP 




5 5 0 


CTGGACATGG 


AGGAGACGAT 


GGTGGAGGCT 






600 


AGAAGGGTTT 


TACATGGCGG 


AGGAGACHAP 




u 1 1 tj i 1 CCGG 


650 


AGT7VACAGAT 


GAGCAAAGGG 


TTTTACATGG 


ACGAPPAPTP 




7 00 


ATGCCGAGGT 


TGTTGGCTGA 


TATGGCGGAA 


GGGATGCTTT 


TGCCGCCCCC 


750 


GTCCGTACAA 


TGGGGACATA 


ACGATGACTT 


CGAAGGAGAT 


GCTGACATGA 


800 


ACCTCT 










806 


(2) INFORMATION FOR SEQ ID NO : 7 9 : 







(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 204 
(3) TYPE: Amino Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Protein 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica rapa 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 
(£) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 
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(ix) FEATURE: 

(A) NAME/KEY: BRCBF3-PEP 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 



(x) PUBLICATION INFORMATION: 





(xi) 


SEQUENCE 


Met 


Asn 




V d J_ 


O c X 










c 
0 


Glu 


Ser 


Pr o 


V d JL 


A c: n 












Ser 


o 


IT J- O 


Lys 


Lys 












Arg 


n X o 


IT 1. O 


X X tr 


Tyr 










D U 


Trp 


V a JL 


^ _y ij 


V3 X U 


V a X 










D D 


Leu 


\J3 J. y 


Th -r 


It I 


Lys 










o u 


Val 


Ala 


Ala 


Leu 


Ala 










95 


Ala 


Asp 


Ser 


Ala 


Trp 










110 


Lys 


Asp 


He 


Gin 


Lys 










125 


Ala 


Glu 


Lys 


Ser 


Asp 










140 


Glu 


Glu 


Thr 


Ser 


Ala 










155 


Thr 


Ala 


Asp 


His 


Gly 










170 


Val 


Phe 


Arg 


Glu 


Glu 










185 


Thr 


Thr 


Val 


Glu 


Gly 










200 



DESCRIPTION 


: SEQ ID 


Th V 

J. nr 


Dh ^ 

ir ne 


ber 


Glu 


Leu 










10 


Th r- 
1 Hx 


tjX u 


/"III 
(j 1 y 


i" 1 . . 
Gly 


Asp 










25 


r r o 


/\x a 


/" 1 » , 
Cjly 


Arg 


Lys 










4 0 


Arg 


tjiX y 


Val 


Arg 


Leu 










55 




oX u 


Pro 


Asn 


Lys 










70 


Th v 

i n r 


i\L a 


CjIu 


He 


Ala 










8 5 


Leu 


Arg 


Gly 


Arg 


Gly 










100 


Arg 


Leu 


Arg 


He 


Pro 










115 


Ala 


Ala 


Ala 


Glu 


Ala 










130 


Thr 


Thr 


Thr 


Asn 


Asp 










145 


Val 


Ala 


Ser 


Pro 


Ala 










160 


Leu 


Asp 


Met 


Glu 


Glu 










175 


Gin 


Arg 


Glu 


Gly 


Phe 










190 


Val 


Val 


Pro 


Glu 












205 



NO: 


79: 








Leu 


Cys 


Ser 


Glu 


Asn 










15 


Tyr 


He 


Leu 


Ala 


Ala 










30 


Lys 


Phe 


Gin 


Glu 


Thr 










4 5 


Arg 


Lvs 


Ser 


Gly 


Lys 










60 


Lys 


Ser 


Arg 


He 


Tro 

1 X ^ 










7 5 


Ala 


Arg 


Ala 


His 


Asp 










90 


Ala 


Cys 


Leu 


Asn 


Phe 










105 


Glu 


Thr 


Thr 


Cys 


Ala 










120 


Ala 


Leu 


Ala 


Phe 


Glu 










135 


Arg 


Gly 


Met 


Asn 


Met 










150 


Glu 


Leu 


Asn 


Asp 


Thr 










165 


Thr 


Met 


Val 


Glu 


Ala 










180 


Tyr 


Met 


Ala 


Glu 


Glu 










195 



(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 755 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 



(V) FRAGMENT TYPE: 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica rapa 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 
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(G) CELL TYPE 

(H) CELL LINE 
C I } ORGANELLE 



N/A 
N/A 
N/A 



50 
100 
150 
200 
250 
300 



(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 
(A) NAME/KEY: brCBF4 gene 
(3) LOCATION: 

(C) IDENTIFICATION METHOD: sequencinq 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO-80* 
ACCGCTCGAG TACTTACTAT ACTACACTCA GCCTTATCCA GTTTTTCTTC 

CAACGATGGA CTCAATCTCT ACTTTTCCTG AACTGCTTGG CTCAGAGAAC 
GAGTCTCCGG TTACTACGGT AGTAGGAGGT GATTATTGTC CCAGGTTGGC 
GGCAAGCTGT CCGAAGAAGC CAGCGGGTAG GAAGAAGTTT CAGGAGACAC 
GTCACCCCAT TTACCGTGGA GTTCGTTTAA GAAAGTCCGG TAAGTGGGTG 
TGTGAAGTGA GGGAACCAAA CAAGAAATCT AGGATTTGGC TCGGAACTTT 
CAAAACCGCT GAGATCGCTG CTCGTGCTCA CGACGTTGCT GCCTTAGCCC 350 
TCCGCGGAAG AGGCGCCTGC CTCAACTTCG CCGACTCGGC TTGACGGCTC 4 00 

CGTATCCCGG AGACAACCTG CGCCAAGGAT ATCCAGAAGG CTGCTGCTGA 
AGCTGCATTG GCTTTTGAGG CCGAGAAGAG TGATCATGGC ATGAACATGA 
AGAATACTAC GGCGGrGGCT TCTCAGGTTG AGGTGAATGA TACGACGACG 5 50 

GACCATGGCG TGGACATGGA GGAGACGAGG GTGGAGGGTG TTGTTACGGA 600 
GGAACAGAAC AATTGGTTTT ACATGGACGA GGAGTGGATG TTTGGGATGC 650 
CGACGTTGTT GGTTGATATG GCGGAAGGGA TGCTTATACC GCGGCAGTCC 7 00 

GTACAATCGG GACACTACGA TGACTTCGAA GGAGATGCTG ACATGAACCT 
CTGGA 

(2) INFORMATION FOR SEQ ID NO : 8 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Protein 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 
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450 
500 



750 
755 
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(v) FRAGMENT TYPE: 



(vi) ORIGINAL SOURCE: 

{A) ORGANISM: Brassica rapa 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 
(!) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME; N/A 



(ix) FEATURE: 

(A) NAME/KEY: BRCBF4-PEP 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequenci 

(D) OTHER INFORMATION: 



(X) PUBLICATION INFORMATION: 





(xi) 


SEQUENCE 


DESCRIPTION 


: SEQ ID 


Met 


Asp 


Ser 


lie 


Ser 


Thr 


Phe 


Pro 


Glu 


Leu 










5 










10 


Glu 


Ser 


Pro 


Val 


Thr 


Thr 


Val 


Val 


Gly 


Gly 










20 








25 


Leu 


Ala 


Ala 


Ser 


Cys 


Pro 


Lys 


Lys 


Pro 


Ala 










35 










40 


Gin 


Giu 


Thr 


Arg 


His 


Pro 


He 


Tyr 


Arg 


Gly 










50 








55 


Ser 


Gly 


Lys 


Trp 


Val 


Cys 


Glu 


Val 


Arg 


Glu 










65 








70 


Arg 


lie 


Trp 


Leu 


Gly 


Thr 


Phe 


Lys 


Thr 


Ala 










80 








85 


Ala 


His 


Asp 


Val 


Ala 


Ala 


Leu 


Ala 


Leu 


Arg 










95 










100 


Leu 


Asn 


Phe 


Ala 


Asp 


Ser 


Ala 
















110 













15 
Arg 

30 
Phe 

45 
Lys 

60 
Ser 

75 
Arg 

90 
Cys 
105 



(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 832 
(3) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 



(v) FRAGMENT TYPE: 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica rapa 
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(B) 


STRAIN: 




(C) 


INDIVIDUAL ISOLATE: 


N/A 


(D) 


DEVELOPMENTAL STAGE: 


N/A 


(E) 


HAPLOTYPE: N/A 




(F) 


TISSUE TYPE: N/A 




(G) 


CELL TYPE: N/A 




(H) 


CELL LINE: N/A 




(I) 


ORGANELLE: N/A 





(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: brCBFS gene 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 



(x) PUBLICATION INFORMATION: 



(xi) SEQUENCE DESCRIPTION: 
ACCGCTCGAG TACTTACTAT ACTACACTCA 

CAACGATGGA CTCAATCTCT ACTTTTCCTG 

GAGTCTCCGG TTACTACGGT AGTAGGAGGT 

GGCAAGCTGT CCGAAGAAGC CAGCGGGTAG 

GTCACCCCAT TTACCGTGGA GTTCGTTTAA 

TGTGAAGTGA GGGAACCAAA CAAGAAATCT 

CAAAACCGCT GAGATCGCTG CTCGTGCTCA 

TCCGCGGAAG AGGCGCCTGC CTCAACTTCG 

CGTATCCCGG AGACAACCTG CGCCAAGGAT 

AGCTGCTTTG GCTTTTGAGG CCGAGAAGAG 

AGAATACTAC GGCGGTGGCT TCTCAGGTTG 

GACCATGGCG TGGACATGGA GGAGACGTTG 

GGAACAGAGA GAAGGGTTTT ACATGACGGA 

TTGTTACGGA GGAACAGAAC AATTGGTTTT 

TTTGGGATGC CGACGTTGTT GGTTGATATG 

GCGGCAGTCC GTACAATCGG GACACTACGA 

ACATGAACCT CTGGAATTAT TAGGGATCCG 

{2) INFORMATION FOR SEQ ID NO: 8: 



SEQ ID NO:82: 

GCCTTATCCA GTTTTTCTTC 5 0 

AACTGCTTGG CTCAGAGAAC 100 

GATTATTGTC CCAGGTTGGC 150 

GAAGAAGTTT CAGGAGACAC 2 00 

GAAAGTCCGG TAAGTGGGTG 250 

AGGATTTGGC TCGGAACTTT 300 

CGACGTTGCT GCCTTAGCCC 350 

CCGACTCGGC TTGGCGGCTC 4 00 

ATCCAGAAGG CTGCTGCTGA 4 50 

TGATCATGGC ATGAACATGA 500 

AGGTGAATGA TACGACGACG 5 50 

GTGGAGGCTG TTTTTACGGA 600 

GGAGACGAGG GTGGAGGGTG 650 

ACATGGACGA GGAGTGGATG 700 

GCGGAAGGGA TGCTTATACC 7 50 

TGACTTCGAA GGAGATGCTG 8 00 

CG 832 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 255 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Protein 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 
(V) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica rapa 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE; N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: BRCBF5-PEP 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 

(X) PUBLICATION INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 



Met 


Asp 


Ser 


He 


Ser 


Thr 


Phe 


Pro 


Glu 


Leu 


Leu 


Gly 


Ser 


Glu 


Asn 










5 










10 










15 


Glu 


Ser 


Pro 


Val 


Thr 


Thr 


Val 


Val 


Gly 


Gly 


Asp 


Tyr 


Cys 


Pro 


Arg 










20 










25 










30 


Leu 


Ala 


Ala 


Ser 


Cys 


Pro 


Lys 


Lys 


Pro 


Ala 


Gly 


Arg 


Lys 


Lys 


Phe 










35 










40 








45 


Gin 


Glu 


Thr 


Arg 


His 


Pro 


He 


Tyr 


Arg 


Gly 


Vai 


Arg 


Leu 


Arg 


Lys 










50 










55 








60 


Ser 


Gly 


Lys 


Trp 


Val 


Cys 


Glu 


Val 


Arg 


Glu 


Pro 


Asn 


Lys 


Lys 


Ser 










65 










70 






75 


Arg 


He 


Trp 


Leu 


Gly 


Thr 


Phe 


Lys 


Thr 


Ala 


Glu 


He 


Ala 


Ala 


Arg 


Ala 








80 










85 










90 


His 


Asp 


Val 


Ala 


Ala 


Leu 


Ala 


Leu 


Arg 


Gly 


Arg 


Gly 


Ala 


Cys 










95 










100 










105 


Leu 


Asn 


Phe 


Ala 


Asp 


Ser 


Ala 


Trp 


Arg 


Leu 


Arg 


He 


Pro 


Glu 


Thr 










110 










115 










120 


Thr 


Cys 


Ala 


Lys 


Asp 


He 


Gin 


Lys 


Ala 


Ala 


Ala 


Glu 


Ala 


Ala 


Leu 










125 










130 










135 


Ala 


Phe 


Glu 


Ala 


Glu 


Lys 


Ser 


Asp 


His 


Gly 


Met 


Asn 


Met 


Lys 


Asn 










140 










145 








150 


Thr 


Thr 


Ala 


Val 


Ala 


Ser 


Gin 


Val 


Glu 


Val 


Asn 


Asp 


Thr 


Thr 


Thr 










155 










160 








165 


Asp 


His 


Gly 


Val 


Asp 


Met 


Glu 


Glu 


Thr 


Leu 


Vai 


Glu 


Ala 


Val 


Phe 










170 










175 










180 


Thr 


Glu 


Glu 


Gin 


Arg 


Glu 


Gly 


Phe 


Tyr 


Met 


Thr 


Glu 


Glu 


Thr 


Arg 










185 










190 










195 


Vai 


Glu 


Gly 


Val 


Val 


Thr 


Glu 


Glu 


Gin 


Asn 


Asn 


Trp 


Phe 


Tyr 


Met 
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200 205 210 

Asp Glu Glu Trp Met Phe Gly Met Pro Thr Leu Leu Val Asp Met 

215 220 225 

Ala Glu Gly Met Leu He Pro Arg Gin Ser Val Gin Ser Gly His 

230 235 240 

Tyr Asp Asp Phe Glu Gly Asp Ala Asp Met Asn Leu Trp Asn Tyr 

245 250 255 

C2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 

(B) TYPE: Nucleic Acid 
{C} STRANDEDNESS : Single 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica rapa 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 



(G) CELL TYPE 

(H) CELL LINE 

(I) ORGANELLE 



N/A 
N/A 
N/A 



(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: brCBF6 gene 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFOE^ATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

TACTACACTC AGCCTTATCC AGTTTTCAAA AAAAGTATTC AACTATGAAC 50 

TCAGTCTCTA CTTTTTCTGA ACTGCTCTGC TCCGAGAACA AGTCTCCGGT 100 

TAATACGGAA GGTGGTGATT ACATTTTGGC GGCGAGCTGT CCCAAGAAAC 150 

CTGCTGGTAG GAAGAAGTTT CAGGAGACAC GCCACCCCAT TTACAGAGGA 2 00 

GTTCGCCTAA GAAAGTCAGG TAAGTGGGTG TGTGAAGTGA GGGAACCAAA 2 50 

CAAGAAATCT AGAATTTGGC TCGGAACTTT CAAAACAGCT GAGATAGCAG 3 00 

CTCGTGCTCA CGACGTCGCC GCCTTAGCTC TCCGTGGAAG AGGCGCCTGC 3 5 0- 
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CTCAACTTCG 


CCGACTCGGC 


TTGGCGGCTC 


CGTATCCCAG 


AGACAArPTn 


A r\c\ 


CGCCAAGGAT 


ATCCAGAAGG 


CTGCTGCTGA 


AGCCGCATTG 


GCTTTTGAf^r^ 

«^ A. X X J. VJ^^VJO 


O U 


CCGAGAAGAG 


TGATACCACG 


ACGAATGATC 


GTGGCATGAA 




c; n 
D U U 


ACGTCCGCGG 


TGGCTTCTCC 


GGCTGAGTTG 


AATGATACGA 




c o 
o 0 U 


TGGCCTGGAC 


ATGGAGGAGA 


CGATGGTGGA 


GGCTGTTTTT 




ri n 
D U U 


AGAGAGAAGG 


GTTTTACATG 


GCGGAGGAGA 


CGACGGTGnA 




c: ri 
D 3 U 






r\\j<j<j L i i i AL. 


AiGGACGAGG 


AGTGGACGTT 


700 


CGAGATGCCG 


AGGTTGTTGG 


CTGATATGGC 


GGAAGGGATG 


CTTCTGCCTC 


750 


CCCCGTCCGT 


ACAATGGGGA 


CATAACGATG 


ACTTCGAAGG 


AGATGCTGAC 


800 


ATGAACCTCT 


GGAATTATTA 


GGGATCCGCG 






830 



(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 258 

(B) TYPE: Amino Acid 

CO STRANDEDNESS: Single 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Protein 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica rapa 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 
CG) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: BRCBF6-PEP 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 5 : 
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Met 


Asn 


Ser 


Val 


Ser 
5 


Thr 


Phe 


Lys 


Ser 


Pro 


Val 


Asn 


Thr 


Glu 










20 






Ser 


Cys 


Pro 


Lvs 


Lys 


Pro 


Ala 










35 






Arg 


His 


Pro 


lie 


Tyr 


Arg 


Gly 










50 




Trp 


Val 


Cys 


Glu 


Val 


Arg 


Glu 










65 






Leu 


Gly 


Thr 


Phe 


Lys 
80 


Thr 


Ala 


Vai 


Ala 


Ala 


Leu 


Ala 


Leu 


Arg 










95 




Ala 


Asp 


Ser 


Ala 


Trp 


Arg 


Leu 










110 






Lys 


Asp 


lie 


Gin 


Lys 


Ala 


Ala 










125 






Ala 


Glu 


Lys 


Ser 


Asp 


Thr 


Thr 










140 






Glu 


Glu 


Thr 


Ser 


Ala 


Val 


Ala 










155 






Thr 


Ala 


Asp 


His 


Gly 


Leu 


Asp 










170 




Val 


Phe 


Arg 


Asp 


Glu 


Gin 


Arg 










185 




Thr 


Thr 


Val 


Glu 


Gly 


Val 


Val 










200 






t'he 


Tyr 


Met 


Asp 


Glu 


Glu 


Trp 










215 




Ala 


Asp 


Met 


Ala 


Glu 


Gly 


Met 










230 




Trp 


Gly 


His 


Asn 


Asp 


Asp 


Phe 










245 






Trp 


Asn 


Tyr 











Ser 


Glu 


Leu 


Leu 


Cys 


Ser 


Glu 


Asn 






10 










15 


Gly 


Giy 


Asp 


Tyr 


He 


Leu 


Ala 


Ala 






25 










30 


Gly 


Arg 


Lys 


Lys 


Phe 


Gin 


Glu 


Thr 






40 










45 


Val 


Arg 


Leu 


Arg 


Lys 


Ser 


Gly 


Lys 






55 








60 


Pro 


Asn 


Lys 


Lys 


Ser 


Arg 


He 


Trp 






70 










75 


Glu 


lie 


Ala 


Ala 


Arg 


Ala 


His 


Asp 






85 










90 


Gly 


Arg 


Gly 


Ala 


Cys 


Leu 


Asn 


Phe 






100 










105 


Arg 


He 


Pro 


Glu 


Thr 


Thr 


Cys 


Ala 






115 








120 


Ala 


Glu 


Ala 


Ala 


Leu 


Ala 


Phe 


Glu 






130 










135 


Thr 


Asn 


Asp 


Arg 


Gly 


Met 


Asn 


Met 






145 










150 


Ser 


Pro 


Ala 


Glu 


Leu 


Asn 


Asp 


Thr 






160 








165 


Met 


Glu 


Glu 


Thr 


Met 


Val 


Glu 


Ala 






175 










180 


Glu 


Gly 


Phe 


Tyr 


Met 


Ala 


Glu 


Glu 






190 










195 


Pro 


Glu 


Glu 


Gin 


Met 


Ser 


Lys 


Gly 






205 








210 


Thr 


Phe 


Glu 


Met 


Pro 


Arg 


Leu 


Leu 






220 








225 


Leu 


Leu 


Pro 


Pro 


Pro 


Ser 


Val 


Gin 






235 










240 


Glu 


Gly 


Asp 


Ala 


Asp 


Met 


Asn 


Leu 






250 










255 



(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 854 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica rapa 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 



86/101 



BNSDOCID: <WO 9938977A2_L> 



wo 99/38977 



PCT/US99/01895 



(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

{ix} FEATURE: 

(A) NAME/KEY: brCBF7 gene 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO • 8 6 • 
CTATACTACA CACAGCCTTA TCCAGCCGCT CGAGTACTTA CTATACTACA 

CTCAGCCTTT TCCAGTTTTT CAAAAGAAGT TTTCAACGAT GAACTCAGTC 
TCTACTCTTT CTGAAGTTCT TGGCTCCCAG AACGAGTCTC CCGTAGGTGG 
TGATTACTGT CCCATGTTGG CGGCGAGCTG TCCGAAGAAG CCGGCGGGTA 
GGAAGAAGTT TCGGGAGACA CGTCACCCCA TTTACAGAGG AGTTCGTCTT 
AGAAAGTCAG GTAAGTGGGT GTGTGAAGTG AGGGAACCAA ACAAGAAATC 
TAGGATTTGG CTCGGAACTT TCAAAACAGC TGAGATCGCA GCTCGTGCTC 350 
ACGACGTTGC CGCCTTAGCT CTCCGTGGAA GAGGCGCCTG CCTCAACTTC 4 00 

GCCGACTCGG CTTGGCGGCT CCGTATCCCG GAGACAACCT GCGCCAAGGA 4 50 

TATCCAGAAG GCTGCTGCTG AAGCCGCATT GGCTTTTGAG GCGGAGAAGA 
GTGATACCAC GACGACGAAT GATCATGGCA TGAACATGGC TTCTCAGGTT 
GAGGTTAATG ACACGACGGA TCATGACCTG GACATGGAGG AGACGATGGT 
GGAGGCTGTT TTTAGGGAGG AACAGAGAGA AGGGTTTTAC ATGGCGGAGG 65 0 

AGACGACGGT GGAGGGTATT GTTCCGGAGG AACAGATGAG CAAAGGGTTT 7 00 

TACATGGACG AGGAGTGGAT GTTCGGGATG CCGACCTTGT TGGCTGATAT 
GGCGGCAGGG ATGCTCTTAC CGCCGCCGTC CGTACAATGG GGACATAATG 
ATGACTTCGA AGGAGATGCT GACATGAACC TCTGGAATTA TTAAGGGATC 8 50 

CGCG 



(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHAEU^CTERI STICS : 

(A) LENGTH: 251 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Protein 



50 
100 
150 
200 
250 
300 



500 
550 
600 



750 
800 



854 
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(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 
(V) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica rape 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 
CD ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: BRCBF7-PEP 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 



Met 


Asn 


Ser 


Val 


Ser 


Thr 


Leu 


Ser 


Glu 


Val 


Leu 


Gly 


Ser 


Gin 


Asn 


Glu 








5 










10 








15 


Ser 


Pro 


Val 


Gly 
20 


Gly 


Asp 


Tyr 


Cys 


Pro 
25 


Met 


Leu 


Ala 


Ala 


Ser 
30 


Cys 


Pro 


Lys 


Lys 


Pro 


Ala 


Gly 


Arg 


Lys 


Lys 


Phe 


Arg 


Glu 


Thr 


Arg 


His 








35 










40 










45 


Pro 


He 


Tyr 


Arg 


Gly 


Val 


Arg 


Leu 


Arg 


Lys 


Ser 


Gly 


Lys 


Trp 










50 










55 








60 


Val 


Cys 


Glu 


Val 


Arg 


Glu 


Pro 


Asn 


Lys 


Lys 


Ser 


Arg 


He 


Trp 


Leu 


Gly 








65 










70 








75 


Thr 


Phe 


Lys 


Thr 


Ala 


Glu 


He 


Ala 


Ala 


Arg 


Ala 


His 


Asp 


Val 


Ala 


Ala 






80 










85 








90 


Leu 


Ala 


Leu 


Arg 


Gly 


Arg 


Gly 


Ala 


Cys 


Leu 


Asn 


Phe 


Ala 


Asp 








95 










100 










105 


Ser 


Ala 


Trp 


Arg 


Leu 


Arg 


He 


Pro 


Glu 


Thr 


Thr 


Cys 


Ala 


Lys 


Asp 


He 






110 










115 








120 


Gin 


Lys 


Ala 


Ala 


Ala 


Glu 


Ala 


Ala 


Leu 


Ala 


Phe 


Glu 


Ala 


Glu 








125 










130. 










135 


Lys 


Ser 


Asp 


Thr 


Thr 


Thr 


Thr 


Asn 


Asp 


His 


Gly 


Met 


Asn 


Met 


Ala 


Ser 






140 










145 








150 


Gin 


Val 


Glu 


Val 


Asn 


Asp 


Thr 


Thr 


Asp 


His 


Asp 


Leu 


Asp 


Met 


Glu 






155 










160 










165 


Glu 


Thr 


Met 


Val 


Glu 


Ala 


Val 


Phe 


Arg 


Glu 


Glu 


Gin 


Arg 


Glu 


Gly 






170 










175 










180 


Phe 


Tyr 


Met 


Ala 


Glu 


Glu 


Thr 


Thr 


Val 


Glu 


Gly 


He 


Val 


Pro 


Glu 






185 










190 










195 


Glu 


Gin 


Met 
200 


Ser 


Lys 


Gly 


Phe 


Tyr 
205 


Met 


Asp 


Glu 


Glu 


Trp 
210 


Met 


Phe 


Gly 


Met 


Pro 


Thr 


Leu 


Leu 


Ala 


Asp 


Met 


Ala 


Ala 


Gly 


Met 


Leu 








215 










220 








225 


Leu 


Pro 


Pro 


Pro 


Ser 


Val 


Gin 


Trp 


Gly 


His 


Asn 


Asp 


Asp 


Phe 
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230 235 240 

Glu Gly Asp Ala Asp Met Asn Leu Trp Asn Tyr 
245 250 

(2) INFORMATION FOR SEQ ID NO : 8 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 738 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 



(A) 


ORGANISM: Glycine 


max 


(B) 


STRAIN: 




(C) 


INDIVIDUAL ISOLATE: 


N/A 


(0) 


DEVELOPMENTAL STAGE 


: N/A 


(E) 


HAPLOTYPE: N/A 




(F) 


TISSUE TYPE: N/A 




(G) 


CELL TYPE: N/A 




(H) 


CELL LINE: N/A 




(I) 


ORGANELLE: N/A 





(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: gmCBFl gene 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 

(X) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 
CATCCGATTT ATAGTGGCGT GAGGAGGAGG AACACGGATA AGTGGGTAAG 



TGAGGTGAGG GAGCCCAACA AAAAGACCAG GATTTGGCTG GGGACTTTTC 100 

CCACGCCGGA GATGGCGGCA CGGGCCCACG ACGTGGCGGC AATGGCCCTG 150 

AGGGGCCGGT ATGCCTGTCT CAACTTCGCT GACTCGACGT GGCGGTTACC 2 00 

AATTCCCGCC ACTGCTAACG CAAAGGATAT ACAGAAAGCA GCAGCAGAGG 2 5€ 

CTGCCGAGGC TTTCAGACCA AGTCAGACCT TAGAAAATAC GAATACAAAG 300 

CAAGAGTGTG TAAAAGTGGT GACGACAACA ACGATCACAG AAC7WWi.CG 3 50 

AGGAATGTTT TATACGGAGG AAGAAGAGCA AGTGTTAGAT ATGCCTGAGT 4 00 

TGCTTAGGAA TATGGTGCTT ATGTCCCCAA CACATTGCAT AGGGTATGAG 4 50 
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TATGAAGATG CTGACTTGGA TGCTCAAGAT GCTGAGGTGT CCCTATGGAG 
TTTCTCAATT TAATAACGTG CTTTTGGTTT GGTTTTTTAT GTTAGTTTTG 
GAGTGTGACT GTCTGTACTG GTTTTTTATT AGTAGTACGG ATACTAGCTA 
TAGGTGGCAG ATTGAAAGGG ACCAAAAGGA ATTTTCTTTT GAAACCCTTT 
TTGTCAAAGT AATCAATCGC GTATCATCAA GTGAATCCCT TGATCAAGTT 
TATGTATGAA TTAAATAAAA GAAGAATCTA GTTTTGGT 

(2) INFORMATION FOR SEQ ID NO : 8 9 : 

Ci) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 170 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Protein 

( iii ) HYPOTHETICAL : No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Glycine max 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 



(G) CELL TYPE 

(H) CELL LINE 

(I) ORGANELLE 



N/A 
N/A 
N/A 



(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: GMCBFl-PEP 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencinq 

(D) OTHER INFORMATION: 4uencing 

(X) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

His Pro He Tyr Ser Gly Val Arg Arg Arg Asn Thr Asp Lys Trp 

Val Ser Glu Val Arg Glu Pro Asn Lys Lys Thr Arg He Trp Le! 

Gly Thr Phe Pro Thr Pro Glu Met Ala a" Arg Ala His Asp vll 

Ala Ala Met Ala Leu Arg Gly Arg Tyr aII Cys Leu Asn Phe aL' 

55 60 



500 
550 
600 
650 
700 
738 
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Asp Ser 
Asp He 
Ser Gin 
Val Val 
Tyr Thr 
Arg Asn 
Tyr Glu 
Trp Ser 



Thr Trp 
Gin Lys 
Thr Leu 
Thr Thr 
Glu Glu 
Met Val 
Asp Ala 
Phe Ser 



Arg Leu 
65 

Ala Ala 

80 
Glu Asn 

95 

Thr Thr 
110 

Glu Glu 
125 

Leu Met 
140 

Asp Leu 
155 
He 
170 



Pro He Pro Ala 

70 

Ala Glu Ala Ala 
85 

Thr Asn Thr Lys 
100 

He Thr Glu Gin 
115 

Gin Val Leu Asp 
130 

Ser Pro Thr His 
145 

Asp Ala Gin Asp 
160 



Thr Ala Asn Ala Lys 
75 

Glu Ala Phe Arg Pro 
90 

Gin Glu Cys Val Lys 
105 

Lys Arg Gly Met Phe 
120 

Met Pro Glu Leu Leu 
135 

Cys He Gly Tyr Glu 
150 

Ala Glu Val Ser Leu 
175 



(2) INFORMATION FOR SEQ ID NO:90: 

(i) SEQUENCE CHARACTERISTICS* 

(A) LENGTH: 793 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Raphanus sativus 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 
O) DEVELOPMENTAL STAGE: N/A 
(S) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: rsCBFl gene 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencinq 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO-90* 

ACTACACTCA GCCTTATCCA GTTTTTCTTC CAACGATGGA CTCAATCTCT 50 

ACTTTTTCTG AACTGCTTGG CTCCGAGAAC GAGTCTCCGG TTACTACGGT IQO 

AGTAGGAGGT GATTATTTTC CCAGGTTGGC GGCAAGCTGT CCGAAGAAGC 150 
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CAGCGGGTAG GAAGAAGTTT CAGGAGACAC GTCACCCCAT TTACCGCGGA 
GTTCGTTTAA GAAAGTCAGG TAAGTGGGTG TGTGAAGTGA GGGAACCAAA 
CAAGAAATCT AGGATTTGGC TCGGAACTTT CAAAACCGCT GAGATCGCTG 
CTCGTGCTCA CGACGTTGCT GCCTTAGCCC TCCGCGGAAG AGGCGCCTGC 
CTCAACTTCG CCGACTCGGC TTGGCGGCTC CGTATCCCGG AGACAACCTG 
CGGCAAGGAT ATCCAGAAGG CTGCTGCTGA AGCTGCATTG GCTTTTGAGG 
CCGAGAAGAG TGATCATGGC ATGAACATGA AGAATACTAC GGCGGTGGCT 
TCTCAGGTTG AGGTGAATGA CACGACGACG GACCATGGCG TGGACATGGA 
GGAGACGTTG GTGGAGGCTG TTTTTACGGA GGAACAGAGA GAAGGGTTTT 
ACATGACGGA GGAGACGAGG GTGGAGGGTG TTGTTACGGA GGAACAGAAC 
AATTGGTTTT ACATGGACGA GGAGTGGATG TTTGGGATGC CGACGTTGTT 
GGTTGATATG GCGGAAGGGA TGCTTTTACC GCGGCCGTCC GTACAATCGG 
GACACTACGA TGACTTCGAA GGAGATGCTG ACATGAACCT CTG 

(2) INFORMATION FOR SEQ ID NO : 9 1 : 

(i) SEQUENCE CHARACTERISTTCS • 

(A) LENGTH: 252 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Protein 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Raphanus sativus 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 



(G) CELL TYPE 

(H) CELL LINE 

(I) ORGANELLE 



N/A 
N/A 
N/A 



(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: RSCBFl-PEP 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 
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(D) OTHER INFORMATION: 

(X) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

Met Asp Ser He Ser Thr Phe Ser Glu Leu Leu Gly Ser Glu Asn 

5 10 15 

Glu Ser Pro Val Thr Thr Val Val Gly Gly Asp Tyr Phe Pro Arg 
20 25 30 

Leu Ala Ala Ser Cys Pro Lys Lys Pro Ala Gly Arg Lys Lys Phe 
35 40 45 

Gin Glu Thr Arg His Pro He Tyr Arg Gly Val Arg Leu Arg Lys 
50 55 gQ 

Ser Gly Lys Trp Val Cys Glu Val Arg Glu Pro Asn Lys Lys Ser 
65 70 75 

Arg He Trp Leu Gly Thr Phe Lys Thr Ala Glu He Ala Ala Arq 
80 85 90 

Ala His Asp Val Ala Ala Leu Ala Leu Arg Gly Arg Gly Ala Cys 
95 100 105 

Leu Asn Phe Ala Asp Ser Ala Trp Arg Leu Arg He Pro Glu Thr 
110 115 ^20 

Thr Cys Ala Lys Asp He Gin Lys Ala Ala Ala Glu Ala Ala Leu 
125 130 135 

Ala Phe Glu Ala Glu Lys Ser Asp His Gly Met Asn Met Lys Asn 

140 145 
Thr Thr Ala Val Ala Ser Gin Val Glu Val Asn Asp Thr Thr Thr 
155 160 165 

Asp His Gly Val Asp Met Glu Glu Thr Leu Val Glu Ala Val Phe 
1'70 175 180 

Thr Glu Glu Gin Arg Glu Gly Phe Tyr Met Thr Glu Glu Thr Arg 

185 190 
Val Glu Gly Val Val Thr Glu Glu Gin Asn Asn Trp Phe Tyr Met 
200 205 210 

Asp Glu Glu Trp Met Phe Gly Met Pro Thr Leu Leu Val Asp Met 
215 220 225 

Ala Glu Gly Met Leu Leu Pro Arg Pro Ser Val Gin Ser Gly His 
230 235 240 

Tyr Asp Asp Phe Glu Gly Asp Ala Asp Met Asn Leu 
245 250 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 682 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Raphanus sativus 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 
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(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: rsCBF2 gene 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencina 

(D) OTHER INFORMATION: 

(X) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO-92- 
ACACCTAAAC CTTATCCAGG TTTAACTTTT TTTTTCATAA AGAGTTTTCA 

ACAATGACCA CATTTTCTAC CTTTTCCGAA ATGTTGGGCT CCGAGTACGA 

GTCTCCGGTT ACATTAGGCG GAGAGTATTG TCCGAAGCTG GCCGCGAGCT 

GTCCGAAGAA ACCAGCTGGT CGTAAGAAGT TTCGAGAGAC GCGCCACCCA 

ATATACAGAG GAGTTCGTCT GAGAAACTCA GGTAAGTGGG TGTGTGAAGT 

GAGGGAGCCA AACAAGAAAT CTAGGATTTG GCTCGGTACT TTCCTAACCG 

CCGAGATCGC AGCGCGTGCC CACGACGTCG CCGCCATAGC CCTCCGCGGC 

AAATCCGCAT GTCTCAATTT CGCTGACTCG GCTTGGCGGC TCCGTATCCC 

GGAGACAACA TGCCCCAAGG ATATACAGAA GGCGGCTGCT GAAGCCGCGG 

TGGCTTTTCA GGCTGAGATA AATGATACGA CGACGGATCA TGGCCTGGAC 

TTGGAGGAGA CGATCGTGGA GGCTATTTTT ACGGAGGTAA ACAACGATGA 

GTTTTATATG GACGAGGAGT CCATGTTCGG GATGCCGTCT TTGTTGGCTA 

GTATGGCGGA AGGGATGCTT TTGCCGCTGC CGTCCGTACA ATCTGAACAT 

AACTGTGACT TCGACGGAGA TGCTGACATG AA 

(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS* 

(A) LENGTH: 209 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Protein 
( iii ) HYPOTHETICAL : No 

(iv) ANTI-SENSE: No 
(V) FRAGMENT TYPE: 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Raphanus sativus 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 

(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: RSCBF2-PEP 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 

(X) PUBLICATION INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 



Met 


Thr 


Thr 


Phe 


Ser 
5 


Thr 


Phe 


Ser 


Glu 


Met 
10 


Leu 


Gly 


Ser 


Giu 


Tyr 
15 


Glu 


Ser 


Pro 


Val 


Thr 
20 


Leu 


Gly 


Gly 


Giu 


Tyr 
25 


Cys 


Pro 


Lys 


Leu 


Ala 
30 


Ala 


Ser 


Cys 


Pro 


Lys 


Lys 


Pro 


Aia 


Gly 


Arg 


Lys 


Lys 


Phe 


Arg 


Glu 










35 










40 








45 


Thr 


Arg 


His 


Pro 


He 

50 


Tyr 


Arg 


Gly 


Val 


Arg 

55 


Leu 


Arg 


Asn 


Ser 


Gly 
60 


Lys 


Trp 


Val 


Cys 


Glu 


Val 


Arg 


Glu 


Pro 


Asn 


Lys 


Lys 


Ser 


Arg 


lie 










65 










70 








75 


Trp 


Leu 


Gly 


Thr 


Phe 


Leu 


Thr 


Ala 


Giu 


He 


Ala 


Aia 


Arg 


Aia 


His 










80 










85 








90 


Asp 


Val 


Aia 


Ala 


lie 


Ala 


Leu 


Arg 


Gly 


Lys 


Ser 


Aia 


Cys 


Leu 


Asn 










95 










100 








105 


Phe 


Aia 


Asp 


Ser 


Aia 
110 


Trp 


Arg 


Leu 


Arg 


He 
115 


Pro 


Glu 


Thr 


Thr 


Cys 
120 


Pro 


Lys 


Asp 


lie 


Gin 
125 


Lys 


Ala 


Aia 


Ala 


Giu 
130 


Ala 


Ala 


Val 


Ala 


Phe 
135 


Gin 


Aia 


Giu 


lie 


Asn 


Asp 


Thr 


Thr 


Thr 


Asp 


His 


Gly 


Leu 


Asp 


Leu 










140 










145 








150 


Giu 


Glu 


Thr 


He 


Val 
155 


Glu 


Aia 


He 


Phe 


Thr 
160 


Giu 


Val 


Asn 


Asn 


Asp 
165 


Giu 


Phe 


Tyr 


Met 


Asp 


Giu 


Giu 


Ser 


Met 


Phe 


Gly 


Met 


Pro 


Ser 


Leu 










170 










175 








180 


Leu 


Ala 


Ser 


Met 


Aia 
185 


Giu 


Gly 


Met 


Leu 


Leu 
190 


Pro 


Leu 


Pro 


Ser 


Val 
195 


Gin 


Ser 


Giu 


His 


Asn 


Cys 


Asp 


Phe 


Asp 


Gly 


Asp 


Ala 


Asp 


Met 





200 205 
(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 349 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zea maize 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 
(E) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 



(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 



(ix) FEATURE: 

(A) NAME/KEY: zmCBFl gene 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequenci 

(D) OTHER INFORMATION: 



(x) PUBLICATION INFORMATION: 



(xi) SEQUENCE DESCRIPTION: 3EQ ID NO-94- 
GGAGTCCGC GGACGGCGGC GGCGGCGGCG ACGACGAGTA CGCGACGGTG 

CTGTCGGCGC CACCCAAGCG GCCGGCGGGG CGGACCAAGT TCCGGGAGAC 

GCGGCACCCC GTGTACCGCG GCGTGCGGCG GCGCGGGCCC GCGGGGCGCT 

GGGTGTGCGA GGTCCGCGAG CCCAACAAGA AGTCGCGCAT CTGGCTCGGC 

ACCTTCGCCA CCCCCGAGGC CGCCGCGCGC GCGCACGACG TGGCCGCGCT 

GGCCCTGCGG GGCCGCGCCG CGTGCCTCAA CTTCGCCGAC TCGGCGCGCC 

TGCTCCAAGT CGACCCCGCC ACGCTCGCCA CCCCCGACGA CATCCGCCG 

(2) INFORMATION FOR SEQ ID NO: 95: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Protein 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zea maize 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 



(G) CELL TYPE 

(H) CELL LINE 

(I) ORGANELLE 



N/A 
N/A 
N/A 



(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: ZMCBFl-PEP 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: sequencing 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 
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Gly 








55 
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Ala 








70 










75 


Ala 


Leu 


Ala 


Leu 


Arg 


Gly 


Arg 


Ala 


Ala 


Asp 








85 
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(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS* 

(A) LENGTH: 39 

(B) TYPE: Nucleic Acid Sequence 

(C) STRANDEDNESS : Single 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: 

(B) STRAIN: 
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(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 



(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
GGAAGATCTA TGAAACAGAG TACTCTGATC AATGAACTC 

(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS* 

(A) LENGTH: 37 

(B) TYPE: Nucleic Acid Sequence 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 
(V) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 



(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 

(X) PUBLICATION INFORMATION: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 
GGAAGATCTG AAACAGAGTA CTCTGATCAA TGAACTC 37 



(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 

(B) TYPE: Nucleic Acid Sequence 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 
{iv) ANTI-SENSE: No 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 



(A) 


ORGANISM: 




(B) 


STRAIN: 




(C) 


INDIVIDUAL ISOLATE: 


N/A 


(D) 


DEVELOPMENTAL STAGE: 


N/A 


(E) 


HAPLOTYPE: N/A 




(F) 


TISSUE TYPE: N/A 




(G) 


CELL TYPE: N/A 




(H) 


CELL LINE: N/A 




(I) 


ORGANELLE: N/A 





(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 
GGAAGATCTA TGAACAGAGT ACTCTGATCA ATGAACTC 38 

(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 

(B) TYPE: Nucleic Acid Sequence 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 
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FRAGMENT TYPE: 

ORIGINAL SOURCE: 

(A) ORGANISM: 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: N/A 

(D) DEVELOPMENTAL STAGE: N/A 

(E) HAPLOTYPE: N/A 

(F) TISSUE TYPE: N/A 

(G) CELL TYPE: N/A 

(H) CELL LINE: N/A 

(I) ORGANELLE: N/A 



(vii) IMMEDIATE SOURCE: N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
GGAAGATCTA TGAACAGAGT ACTCTGATGC AATGAACTC 

(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 

(B) TYPE: Nucleic Acid Sequence 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DMA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 



(A) 


ORGANISM: 




(B) 


STRAIN: 




(C) 


INDIVIDUAL ISOLATE: 


N/A 


(D) 


DEVELOPMENTAL STAGE: 


N/A 


(E) 


HAPLOTYPE: N/A 




(F) 


TISSUE TYPE: N/A 




(G) 


CELL TYPE: N/A 




(H) 


CELL LINE: N/A 




(I) 


ORGANELLE: N/A 





(vii) IMMEDIATE SOURCE; N/A 

(viii) POSITION IN GENOME: N/A 

(ix) FEATURE: 
(A) NAME/KEY: 



(V) 

(vi) 
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(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
GGAGGATCCT CGTTTCTACA ACAATAAAAT AAAATAAAAT GAAGGAACC 
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which regulates expression of the gene encoding the binding protein. A method for altering an environmental stress tolerance of a plant is 
aLso provided which comprises the steps of transforming a plant with a promoter which regulates expression of at least one copy of a gene 
encoding a binding protein capable of binding to a DNA regulatory sequence which regulates one or more environmental stress tolerance 
genes in the plant; expressing the binding protein encoded by the gene; and stimulating expression of at least one environmental stress 
tolerance gene through binding of the binding protein to the DNA regulatory sequence. 
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1, Claims: 1-30 all partially 

A binding protein other than CBF-1 in isolated form 
comprising a consensus sequence sufficiently homologous to 
or being the consensus sequence shown in Figure 19A, 19B» or 
19C, that the protein is capable of binding to a CCG 
regulatory sequence. Said protein comprising a sequence 
selected from the group consisting of an AP2 domain of SEQ 
ID Nos, 13 and 15. 

A non-natural ly occuring binding prptein comprising an ammo 
acid sequence capable of binding to a CCG regulatory 
sequence and an amino acid sequence which forms a 
transcription activation region. 

ONA encoding said proteins, said proteins recombinantly 
expressed within a plant, nucleic acid constructs and 
expression constructs comprising said DNA and a promoter 
being tissue specific, e.g. flower, fruit, or seed specific 
in plants, as well as plant material comprising said subject 
matter. 

A method for altering an environmental stress tolerance of a 
plant by expressing said proteins in a plant and thereby 
stimulating the expression of at least one environmental 
stress tolerance gene through binding of said binding 
protein to a DNA regulatory sequence comrpising CCG which 
regulates the expression of one or more environmental stress 
tolerance genes in the plant, optionally expressing the 
protein at levels and under conditions at which the plant 
does not express the binding protein in the plant's native 
state. 



2. Claims: 1-3G all partially 

idem for SEQ ID NOs: 39,41,43,45 



3. Claims: 1-30 all partially 

idem for SEQ ID N0s:47,49,51,53,55,57,59,61,63 



4. Claims: 1-30 all partially 

idem for SEQ ID N0s:65,67,69,71,73 



5. Claims: 1-30 all partially 

* idem for SEQ ID M0s:75,77,79,81,83,85,87 



6. Claims: 1-30 all partially 
idem for SEQ ID NO: 89 
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7, Claims: 1-30 all partially 

idem for SEQ ID N0s:91,93 

8. Claims: 1-30 all partially 

idem for SEQ ID N0:95 
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